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ATGGGCCGGGAGCTCCGCTTCGCCCACCTCCACCAGCACA 

CCCAGTTCTCCCTCCTGGAGGGGGCGGCGAAGCTTTCCGA 

CCTCCTCAAGTGGGTCAAGGAGACGACCCCCGAGGACCCC 

GCCTTGGCCATGACCGACCACGGCAACCTCTTCGGGGCCG 

TGGAGTTCTACAAGAAGGCCACCGAAATGGGCATCAAGCC 

CATCCTGGGCTACGAGGCCTACGTGGCGGCGGAAAGCCGC 

TTTGACCGCAAGCGGGGAAAGGGCCTAGACGGGGGCTACT 

TTCACCTCACCCTCCTCGCCAAGGACTTCACGGGGTACCA* 

GAACCTGGTGCGCCTGGCGAGCCGGGCTTACCTGGAGGGG 

TTTTACGAAAAGCCCCGGATTGACCGGGAGATCCTGCGCG 

AGCACGCCGAGGGCCTCATCGCCCTCTCGGGGTGCCTCGG 

GGCGGAGATCCCCCAGTTCATCCTCCAGGACCGTCTGGAC 

CTGGCCGAGGCCCGGCTCAACGAGTACCTCTCCATCTTCA 

AGGACCGCTTCTTCATCGAGATCCAGAACCACGGCCTCCC 

CGAGCAGAAAAAGGTCAACGAGGTCCTCAAGGAGTTCGCC 

CGAAAGTACGGCCTGGGGATGGTGGCCACCAACGACGGCC 

ATTACGTGAGGAAGGAGGACGCCCGCGCCCACGAGGTCCT 

CCTCGCCATCCAGTCCAAGAGCACCCTGGACGACCCCGGG 

CGCTGGCGCTTCCCCTGCGACGAGTTCTACGTGAAGACCC 

CCGAGGAGATGCGGGCCATGTTCCCCGAGGAGGAGTGGGG 

GGACGAGCCCTTTGACAACACCGTGGAGATCGCCCGCATG 

TGCAACGTGGAGCTGCCCATCGGGGACAAGATGGTCTACC 

GAATCCCCCGCTTCCCCCTCCCCGAGGGGCGGACCGAGGC 

CCAGTACCTCATGGAGCTCACCTTCAAGGGGCTCCTCCGC 

CGCTACCCGGACCGGATCACCGAGGGCTTCTACCGGGAGG 

TCTTCCGCCTTTTGGGGAAGCTTCCCCCCCACGGGGACGG 

GGAGGCCTTGGCCGAGGCCTTGGCCCAGGTGGAGCGGGAG 

GCTTGGGAGAGGCTCATGAAGAGCCTCCCCCCTTTGGCCG 

GGGTCAAGGAGTGGACGGCGGAGGCCATTTTCCACCGGGC 

CCTTTACGAGCTTTCCGTGATAGAGCGCATGGGGTTTCCC 

GGCTACTTCCTCATCGTCCAGGACTACATCAACTGGGCCC 

GGAGAAACGGCGTCTCCGTGGGGCCCGGCAGGGGGAGCGC 

CGCCGGGAGCCTGGTGGCCTACGCCGTGGGGATCACCAAC 

ATTGACCCCCTCCGCTTCGGCCTCCTCTTTGAGCGCTTCC 

TGAACCCGGAGAGGGTCTCCATGCCCGACATTGACACGGA 

CTTCTCCGACCGGGAGCGGGACCGGGTGATCCAGTACGTG 

CGGGAGCGCTACGGCGAGGACAAGGTGGCCCAGATCGGCA 

CCCTGGGAAGCCTCGCCTCCAAGGCCGCCCTCAAGGACGT 

GGCCCGGGTCTACGGCATCCCCCACAAGAAGGCGGAGGAA 

TTGGCCAAGCTCATCCCGGTGCAGTTCGGGAAGCCCAAGC 

CCCTGCAGGAGGCCATCCAGGTGGTGCCGGAGCTTAGGGC 

GGAGATGGAGAAGGACCCCAAGGTGCGGGAGGTCCTCGAG 

GTGGCCATGCGCCTGGAGGGCCTGAACCGCCACGCCTCCG 

TCCACGCCGCCGGGGTGGTGATCGCCGCCGAGCCCCTCAC 

GGACCTCG.TCCCCCTCATGCGCGACCAGGAAGGGCGGCCC 

GTCACCCAGTACGACATGGGGGCGGTGGAGGCCTTGGGGC 

TTTTGAAGATGGACTTTTTGGGCCTCCGCACCCTCACCTT 



FIG. 16A 



CCTGGACGAGGTCAAGCGCATCGTCAAGGCGTCCCAGGGG ■ ' f • 1920 

GTGGAGCTGGACTACGATGCCCTCCCCCTGGACGACCCCff ' " * ? . . 

AGACCTTCGCCCTCCTCTCCCGGGGGGAGACCAAGGGGGT 

CTTCCAGCTGGAGTCGGGGGGGATGACCGCCACGCTCCGC 204 0 

GGCCTCAAGCCGCGGCGCTTTGAGGACCTGATCGCCATCC 

TCTCCCTCTACCGCCCCGGGCCCATGGAGCACATCCCCAC 

CTACATCCGCCGCCACCACGGGCTGGAGCCCGTGAGCTAC 2160 

AGCGAGTTTCCCCACGCCGAGAAGTACCTAAAGCCCATCC 

TGGACGAGACCTACGGCATCCCCGTCTACCAGGAGCAGAT" 

CATGCAGATCGCCTCGGCCGTGGCGGGGTACTCCCTGGGC . „. . 2280 

GAGGCGGACCTCCTGCGGCGGTCCATGGGCAAGAAGAAGG 

TGGAGGAGATGAAGTCCCACCGGGAGCGCTTCGTCCAGGG 

GGCCAAGGAAAGGGGCGTGCCCGAGGAGGAGGCCAACCGC 24 00 

CTCTTTGACATGCTGGAGGCCTTCGCCAACTACGGCTTCA 

ACAAATCCCACGCTGCCGCCTACAGCCTCCTCTCCTACCA 

GACCGCCTACGTGAAGGCCCACTACCCCGTGGAGTTCATG 2520 

GCCGCCCTCCTCTCCGTGGAGCGGCACGACTCCGAC7^AGG - 

TGGCCGAGTACATCCGCGACGCCCGGGCCATGGGCATAGA 

GGTCCTTCCCCCGGACGTCAACCGCTCCGGGTTTGACTTC 2 64 0 

CTGGTCCAGGGCCGGCAGATCCTTTTCGGCCTCTCCGCGG 

TGAAGAACGTGGGCGAGGCGGCGGCGGAGGCCATTCTCCG 

GGAGCGGGAGCGGGGCGGCCCCTACCGGAGCCTCGGCGAC 27 60 

TTCCTCAAGCGGCTGGACGAGAAGGTGCTCAACAAGCGGA 

CCCTGGAGTCCCTCATCAAGGCGGGCGCCCTGGACGGCTT 

CGGGGAAAGGGCGCGGCTCCTCGCCTCCCTGGAAGGGCTC 28 80 

CTCAAGTGGGCGGCCGAGAACCGGGAGAAGGCCCGCTCGG 

GCATGATGGGCCTCTTCAGCGAAGTGGAGGAGCCGCCTTT 

GGCCGAGGCCGCCCCCCTGGACGAGATCACCCGGCTCCGC 3000 

TACGAGAAGGAGGCCCTGGGGATCTACGTCTCCGGCCACC 

CCATCTTGCGGTACCCCGGGCTCCGGGAGACGGCCACCTG 

CACCCTGGAGGAGCTTCCCCACCTGGCCCGGGACCTGCCG 3120 

CCCCGGTCTAGGGTCCTCCTTGCCGGGATGGTGGAGGAGG 

TGGTGCGCAAGCCCACAAAGAGCGGCGGGATGATGGCCCG 

CTTCGTCCTCTCCGACGAGACGGGGGCGCTTGAGGCGGTG 324 0 

GCATTCGGCCGGGCCTACGACCAGGTCTCCCCGAGGCTCA 

AGGAGGACACCCCCGTGCTCGTCCTCGCCGAGGTGGAGCG 

GGAGGAGGGGGGCGTGCGGGTGCTGGCCCAGGCCGTTTGG 3360 

ACCTACGAGGAGCTGGAGCAGGTCCCCCGGGCCCTCGAGG 

TGGAGGTGGAGGCCTCCCTCCTGGACGACCGGGGGGTGGC 

CCACCTGAAAAGCCTCCTGGACGAGCACGCGGGGACCCTC 34 8 0 

CCCCTGTACGTCCGGGTCCAGGGCGCCTTCGGCGAGGCCC 

TCCTCGCCCTGAGGGAGGTGCGGGTGGGGGAGGAGGCTGT 

AGGCGGCCGCGTGGTTCCGGGCCTACCTCCTGCCCGACCG 3600 

GGAGGTCCTTCTCCAGGGCGGCCAGGCGGGGGAGGCCCAG 

GAGGCGGTGCCCTTCTAGGGGGTGGGCCGTGAGACCTAGC 

GCCATCGTTCTCGCCGGGGGCAAGGAGGCCTGGGCCCGAC 3720 

CCCTTTTGG 



FIG. 16B 



MGRELRFAHLHQHTQFSLLDGAPKLSDLLKWVEETTPEDP 

ALAMT DHGNL FGAVE FYKKATEMG I KP I LG YEAYVAAESR ■ 

FDRKRGKGLDGGYFHLTLLAKDFTGYQNLVRLASRAYLEG ' " 

FYEKPRIDREILREHAEGLIALSGCLGAEIPQFILQDRLD 

LAEARLNEYLSIFKDRFFIEIQNHGLPEQKKVNEVLKEFA 

RKYGLGMVATNDGHYVRKEDARAHEVLLAIQSKSTLDDPG 

ALALPCEEFYVKTPEEMRAMFPEEEVGGRSPLTTPWRSPH 

VQRGAAIGTRWSTRIPRFPLPEGRTEAQYLMELTFKGLLR 

RYPDRITEG F Y RE V FRL S GKL P PH G DGE AL AE AL AQ VERE 

AWERLMKSLPPLAGVKEWTAEAIFHRALYELSAIERMGFP 

GLLPHRPGLHQLGPEKGVSVGPGRGGAAGSLVAYAVGITN 

IDPLRFGLLFERFLNPERVSMPDIDTDFSDRERDRVIQYV 

RERYGEDKVAQIGTLGSLASKAALKEVARVYGIPRKKAEE 

LAKLIPVQFGKPKPLQEAIQWPELRAEMEKDPKVREVLE 

VAMRLEGLNRHASVHAGRGGVFSEPLTDLVPLCATRKGGP 

YTQYDMGAVEALGLLKMDFLGLRTLTFLDEVKRIVKASQG 

VELDYDALPLDDPKTFALLSRGETKGVFQLESGGMTATLR 

GLKPRRFEDLIAILSLYRPGPMEHIPTYIRRHHGLEPVSY 

SEFPHAEKYLKPILDETYGIPVYQEQIMQIASAVAGYSLG 

EADLLRRSMGKKKVEEMKSHRERFVQGAKERGVPEEEANR 

LFDMLEAFANYGFNKSHAAAYSLLSYQTAYVKAHYPVEFM 

AALLSVERHDSDKVAEYIRDARAMGIEVLPPDVNRSGFDF 

LVQGRQILFGLSAVKNVGEAAAEAILRERERGGPYRSLGD 

FLKRLDEKVLNKRTLESLIKAGALDGFGERARLLASLEGL 

LKWAAENREKARSGMMGLFSEVEEPPLAEAAPLDEITRLR 

YEKEALGI YVSGHPILRYPGLRETATCTLEELPHLARDLP 

PRSRVLLAGMVEEWRKPTKSGGMMARFVLSDETGALEAV 

AFGRAYDQVSPRLKEDTPVLVLAEVEREEGGVRVLAQAVW 

TYQELEQVPRALEVEVEASLPDDRGVAHLKSLLDEHAGTL 

PLYVRVQGAFGEALLALREVRVGEEALGALEAAGFPAYLL 

PNREVSPRLTGSGGPRGRALSTGLALKTYPIALPGGNEAL 

ARPLL 
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FIG.18A 



ATGGTGGAGCGGGTGGTGCGGACCCTTCTGGACGGGAGGT 4 0 
TCCTCCTGGAGGAGGGGGTGGGGCTTTGGGAGTGGCGCTA 
CCCCTTTCCCCTGGAGGGGGAGGCGGTGGTGGTCCTGGAC 120 
CTGGAGACCACGGGGCTTGCCGGCCTGGACGAGGTGATTG : . 
AGGTGGGCCTCCTCCGCCTGGAGGGGGGGAGGCGCCTCCC 200 
CTTCCAGAGCCTGGTCCGGCCCCTCCCGCCCGCCGAAGCC 
CGTTCGTGGAACCTCACCGGCATCCCCCGGGAGGCCCTGG 28 0 
AGGAGGCCCCCTCCCTGGAGGAGGTTCTGGAGAAGGCCTA 
CCCCCTCCGCGGCGACGCCAGCTTGGTGATCCACAACGCC 360 
GCCTTTGACCTGGGCTTCCTCCGCCCGGCCTTGGAGGGCC 
- TGGGCTACCGCCTGGAAAACCCCGTGGTGGACTCCCTGCG 440 
CTTGGCCAGACGGGGCTTACCAGGCCTTAGGCGCTACGGC 
CTGGACGCCCTCTCGGAGGTGC TGGAGC TTGC CCG AAGG A 520 
CCTGCCACCGGGCCCTCGAGGACGTGGAGCGCACCCTCGC 
CGTGGTGCACGAGGTATACTATATGCTTACGTCCGGCCGT 600 
CCCCGCACGCTTTGGGAACTCGGGAGGTAG 



MVERWRTLLDGRFLLEEGVGLWEWRYPFPLEGEAVWLD 40 
LETTGLAGLDEVIEVGLLRLEGGRRLPFQSLVRPLPPAEA 

RSWNLTG I PREALEEAP SLEEVLEKAYPLRGDATLVI HNA 12 0 
AFDLGFLRPALEGLGYRLENPWDSLRLARRGLPGLRRYG 

LDALSEVLELPRRTCHRALEDVERTLAWHEVYYMLTSGR 200 
PRTLWELGRZ 
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GTGTCGCACGAGGCCGTCTGGCAACACGTTCTGGAGCAGA 
TCCGCCGCAGCATCACCGAGGTGGAGTTCCACACCTGGTT 
TGAAAGGATCCGCCCCTTGGGGATCCGGGACGGGGTGCTG 12 0 
GAGCTCGCCGTGCCCACCTCCTTTGCCCTGGACTGGATCC 

ggcgccactacgccGgcctcatccaggagggccctcggct 

CCTCGGGGCCCAGGCGCCCCGGTTTGAGCTCCGGGTGGTG 240 

CCCGGGGTCGTAGTCCAGGAGGACATCTTCCAGCCCCCGC 

CGAGCCCCCCGGCCCAAGCTCAACCCGAAGATACCTTTAA 

AACTTCGTGGTGGGGCCCAACAACTCCATGGCCCCACGGC 3 60 

GGCGCCGTGGCCGTGGCCGAGTCCCCCGGCCGGGCCTACA 

ACCCCCTCTTCATCTACGGGGGCCGTGGCCTGGGAAAGAC 

CTACCTGATGCACGCCGTGGGCCCACTCCGTGCGAAGCGC 480 

TTCCCCCACATGAGATTAGAGTACGTTTCCACGGAAACTT 

TCACCAACGAGCTCATCAACCGGCCATCGGCGAGGGACCG 

GATGACGGAGTTCCGGGAGCGGTACCGCTCCGTGGACCTC 600 

CTGCTGGTGGACGACGTCCAGTTCATCGCCGGAAAGGAGC 

GCACCCAGGAGGAGTTTTTCCACACCTTCAACGCCCTTTA 

CGAGGCCCACAAGCAGATCATCCTCTCCTCCGACCGGCCG 72 0 

cccaaggacatcctcaccctggaggcggGcctgcggagcc 

gctttgagtggggcctgatcaccgacaatccagcccccga 

cctggaaacccggatcgccatgctgaagatgaacggcagc 840 

agcgggcctgaggatcccgaggacgccctggagtacatcg 

cccgGcaggtcacctccaacatccgggagtgggaaggggc 

cctcatgcgggcatcgcctttcgcctccctcaacggcgtt 9 60 

gagctgacccgcgccgtggcggccaaggctctccgacatc 

ttggccccagggagctggaggcggaccccttggagatcat 

gcgcaaagcggcgggaccagttcggcctgaaaccccggga 1 08 0 

ggagctcacggggagcgccgcaagaaggaggtggtcctcc 

cccggcagctcgccatgtacctggtgcgggagctcacccc 

ggcctccctgcccgagatcgaccagctcaacgacgaccgg 12 00 

gaccacaccacggtcctctacgccatccagaaggtccagg 

agctcgcggaaagcgaccgggaggtgcaggggctcctccg 

c acgctccgggaggcgtgc ac atga 



FIG.20A 



VSHE AVWQHVLEH I RRS I TEVEFHTWFERI RP LG I RDGVL 

ELAVPTSFALDWIRRHYAGLIQEGPRLLGAQAPRFELRVV 
PGVWQEDIFQPPPSPPAQAQPEDTFKTSWWGPTTPWPHG 120 
GAVAVAESPGRAYNPLFIYGGRGLGKTYLMHAVGPLRAKR 
FPHMRLEYVSTETFTNELINRPSARDRMTEFRERYRSVDL 
LLVDDVQF I AGKERTQEEFFHTFNALYEAHKQ 1 1 L S SDRP 240 
PKDILTLEARLRSRFEWGLITDNPAPDLETRIAILKMNAS 
SGPEDP EDALEY I ARQVTSNI REWEGALMRAS PF AS LNGV 
ELTRAVAAKALRHLRPRELEADPIiEIIRKAAGPVRPETPG 3 60 
GAHGERRKKEWLPRQLAMYLVRELTPASIjPEIDQLNDDR 
DHTTVLYAIQKVQELAESDREVQGLLRTLREACT 



FIG.20B 



ATGAACATAACGGTTCCCAAAAAACTCCTCTCGGACCAGC 4 0 
TTTCCCTCCTGGAGCGCATCGTCGCCTCTAGAAGCGCCAA 
CCCCCTCTACACCTACCTGGGGCTTTACGCCGAGGAAGGG 12 0 
GCCTTGATCCTCTTCGGGACCAACGGGGAGGTGGACCTCG 
AGGTCCGCCTCCCCGCCGAGGCCCAAAGCCTTCCCCGGGT 2 00 
GCTCGTCCCCGCCCAGCCCTTCTTCCAGCTGGTGCGGAGC 
CTTCCTGGGGACCTCGTGGCCCTCGGCCTCGCCTCGGAGC 28 0 
CGGGCCAGGGGGGGCAGGTGGAGCTCTCCTCCGGGCGTTT 
CCGCACCCGGCTCAGCCTGGCCCCTGCCGAGGGCTACCCC 3 6 0 
GAGCTTCTGGTGCCCGAGGGGGAGGACAAGGGGGCCTTCC 
CCCTCCGGACGCGGATGCCCTCCGGGGAGCTCGTCAAGGC 44 0 
CTTGACCCACGTGCGCTACGCCGCGAGCAACGAGGAGTAC 
CGGGCCATCTTCCGCGGGGTGCAGCTGGAGTTCTCCCCCC 52 0 
AGGGCTTCCGGGCGGTGGCCTCCGACGGGTACCGCCTCGC 
CCTCTACGACCTGCCCCTGCCCCAAGGGTTCCAGGCCAAG 60 0 
GCCGTGGTCCCCGCCCGGAGCGTGGACGAGATGGTGCGGG 
TCCTGAAGGGGGCGGACGGGGCCGAGGCCGTCCTCGCGCT 68 0 
GGGCGAGGGGGTGTTGGCCCTGGCCCTCGAGGGCGGAAGC 
GGGGTCCGGATGGCCCTCCGCCTCATGGAAGGGGAGTTCC 760 
CCGACTAGCAGAGGGTCATCCCCCAGGAGTTCGCCCTCAA 
GGTCCAGGTGGAGGGGGAGGCCCTCAGGGAGGCGGTGCGC 840 
CGGGTGAGGGTC CTCTC CGAC GGGC AG AACC ACCGGGTGG - 

ACCTCCTTTTGGAGGAAGGCCGGATCCTCCTCTCGGCCGA 92 0 
GGGGGACTACGGCAAGGGGCAGGAGGAGGTGCGCGCCCAG 
GTGGAGGGGCCGGACATGGCCGTGGCCTACAACGCCCGCT 1000 
ACCTCCTCGAGGCCCTCGCCCCCGTGGGGGACCGGGCCCA 
CCTGGGCATCTCCGGGCCCACGAGCCCGAGCCTCATCTGG 10 80 
GGGGACGGGGAGGGGTACCGGGCGGTGGTGGTGCCCCTCA 
GGGTCTAG n 28 

FIG.21A 



MNITVPKKLLSDQLSLLERIVPSRSANPLYTYLGLYAEEG 40 
ALILFGTNGEVDLEVRLPAEAQSLPRVLVPAQPFFQLVRS 
LPGDLVALGLASEPGQGGQLELSSGRFRTRLSLAPAEGYP 12 0 
- ELLVPEGEDKGAFPLRTRMPSGELVKALTHVRYAASNEEY 

RAI FRGVQLEFS PQGFRAVASDGYRLALYDLPLPQGFQAK 2 00 
AWP AR S VDEMVRVLKG AEX3 AEAVLALGEGVLAIiAIjEGG S. 
GVRMALRLMEGEF PDYQRVI PQEFALKVQVEGEALREAVR 2 80 
RVSVLSDRQNHRVDLLLEEGRILLSAEGDYGKGQEEVPAQ 
VEGPDMAVAYNARYLLEALAPVGDRAHLGI SGPTS PSLI W 3 60 
GDGEGYRAVWPLRVZ 



FIG.21B 
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FIG . 24 A Induction 




P 



I 

Lysis 

I 

Heat Step 

I 

Fl G . 24 B MonoQ Column 
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FIG. 33A 
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FIG. 33E 



Temperature (°C) 



ATGAGTAAGGATTTCGTCCACCTTCACCTGCACACCCAGTTCTCACTCCT 
GGACGGGGCTATAAAGATAGACGAGCTCGTGAAAAAGGCAAAGGAGTATG 100 
GATACAAAGCTGTCGGAATGTCAGACCACGGAAACCTCTTCGGTTCGTAT 
AAATTCTACAAAGCCCTGAAGGCGGAAGGAATTAAGCCCATAATCGGCAT 200 
GGAAGCCTACTTTACCACGGGTTCGAGGTTTGACAGAAAGACTAAAACGA 
GCGAGGACAACATAACCGACAAGTACAACCACCACCTCATACTTATAGCA 300 
AAGGACGAAAAGGTCTAAAGAACTTAATGAAGCTCTCAACCCTCGCCTAC 
AAAGAAGGTTTTTACTACAAACCCAGAATTGATTAQGAACTCCTTGAAAA 4 00 
GTACGGGGAGGGCCTAATAGCCCTTACCGCATGCCTGAAAGGTGTTCCCA 
CCTACTACGCTTCTATAAACGAAGTGAAAAAGGCGGAGGAATGGGTAAAG 500 
AAGT T C AAGG AT AT AT T C G GAG AT G AC C T T T AT T T AG AAC T T C AAG C G AA 
CAACATTCCAGAACAGGAAGTGGCAAACAGGAACTTAATAGAGATAGCCA 600 
AAAAGTACGATGTGAAACTCATAGCGACGCAGGACGCCCACTACCTCAAT 
CCCGAAGACAGGTACGCCCACACGGTTCTTATGGCACTTCAAATGAAAAA 7 00 
GACCATTCACGAACTGAGTTCGGGAAACTTCAAGTGTTCAAACGAAGACC 
TTCACTTTGCTCCACCCGAGTACATGTGGAAAAAGTTTGAAGGTAAGTTC 8 00 
GAAGGCTGGGAAAAGGCACTCCTGAACACTCTCGAGGTAATGGAAAAGAC 
AGCGGACAGCTTTGAGATATTTGAAAACTCCACCTACCTCCTTCCCAAGT 900 
ACGACGTTCCGCCCGACAAAACCCTTGAGGAATACCTCAGAGAACTCGCG 
TACAAAGGTTTAAGACAGAGGATAGAAAGGGGACAAGCTAAGGATACTAA 1000 
AGAGTACTGGGAGAGGCTCGAGTACGAACTGGAAGTTATAAACAAAATGG 
GCTTTGCGGGATACTTCTTGATAGTTCAGGACTTCATAAACTGGGCTAAG 1100 
AAAAACGACATACCTGTTGGACCCGGAAGGGGAAGTGCTGGAGGTTCCCT 
CGTCGCATACGCCATCGGAATAACGGACGTTGACCCTATAAAGCACGGAT 1200 
TCCTTTTTGAGAGGTTCTTAAACCCCGAAAGGGTTTCCATGCCGGATATA 
GACGTGGATTTCTGTCAGGACAACAGGGAAAAGGTCATAGAGTACGTAAG 1300 
GAACAAGTACGGACACGACAACGTAGCTCAGATAATCACCTACAACGTAA 
TGAAGGCGAAGCAAACACTGAGAGACGTCGCAAGGGCCATGGGACTCCCC 14 00 
TACTCCACCGCGGACAAACTCGCAAAACTCATTCCTCAGGGGGACGTTCA 
GGGAACGTGGCTCAGTCTGGAAGAGATGTACAAAACGCCTGTGGAGGAAC 1500 
TCCTTCAGAAGTACGGAGAACACAGAACGGACATAGAGGACAACGTAAAG 
AAGTTCAGACAGATATGCGAAGAAAGTCCGGAGATAAAACAGCTCGTTGA 1600 
GACGGCCCTGAAGCTTGAAGGTCTCACGAGACACACCTCCCTCCACGCCG 
CGGGAGTGGTTATAGCACCAAAGCCCTTGAGCGAGCTCGTTCCCCTCTAC 17 00 
TACGATAAAGAGGGCGAAGTCGCAACCCAGTACGACATGGTTCAGCTCGA 
AGAACTCGGTCTCCTGAAGATGGACTTCCTCGGACTCAAAACCCTCACAG 18 00 
AAC T G AAAC T C AT G AAAG AAC T C AT AAAG G AAAG AC AC G GAG T G GAT AT A 
AACTTCCTTGAACTTCCCCTTGACGACCCGAAAGTTTACAAACTCCTTCA 1900 
GGAAGGAAAAACCACGGGAGTGTTCCAGCTCGAAAGCAGGGGAATGAAAG 
AACTCCTGAAGAAACTAAAGCCCGACAGCTTTGACGACATCGTTGCGGTC 2000 
CTCGCACTCTACAGACCCGGACCTCTAAAGAGCGGACTCGTTGACACATA 
CATTAAGAGAAAGCACGGAAAAGAACCCGTTGAGTACCCCTTCCCGGAGC 2100 
TTGAACCCGTCCTTAAGGAAACCTACGGAGTAATCGTTTATCAGGAACAG 
GTGATGAAGATGTCTCAGATACTTTCCGGCTTTACTCCCGGAGAGGCGGA 2200 
TACCCTCAGAAAGGCGATAGGTAAGAAGAAAGCGGATTTAATGGCTCAGA 
TGAAAGACAAGTTCATACAGGGAGCGGTGGAAAGGGGATACCCTGAAGAA 2300 
AAGATAAGGAAGCTCTGGGAAGACATAGAGAAGTTCGCTTCCTACTCCTT 
CAACAAGTCTCACTCGGTAGCTTACGGGTACATCTCCTACTGGACCGCCT 24 00 



FIG. 34A 



ACGTTAAAGCCCACTATCCCGCGGAGTTCTTCGCGGTAAAACTCACAACT 

G AAAAG AAC G AC AAC AAG T T C C T C AAC C T C AT AAAAG AC G C T AAAC T C T T 2 5,0 0 

CGGATTTGAGATACTTCCCCCCGACATAAACAAGAGTGATGTAGGATTTA 

CGATAGAAGGTGAAAACAGGATAAGGTTCGGGCTTGCGAGGATAAAGGGA 2 600 

GTGGGAGAGGAAACTGCTAAGATAATCGTTGAAGCTAGAAAGAAGTATAA 

GCAGTTCAAAGGGCTTGCGGACTTCATAAACAAAACCAAGAACAGGAAGA 27 00 

TAAACAAGAAAGTCGTGGAAGCACTCGTAAAGGCAGGGGCTT.TTGACTTT 

ACTAAGAAAAAGAGGAAAGAACTACTCGCTAAAGTGGCAAACTCTGAAAA 28 00 

AGCATTAATGGCTACACAAAACTCCCTTTTCGGTGCACCGAAAGAAGAAG 

TGGAAGAACTCGACCCCTTAAAGCTTGAAAAGGAAGTTCTCGGTTTTTAC 2 900 

ATTTCAGGGCACCCCCTTGACAACTACGAAAAGCTCCTCAAGAACCGCTA 

CACACCCATTGAAGATTTAGAAGAGTGGGACAAGGAAAGCGAAGCGGTGC 3000 

TTACAGGAGTTATCACGGAACTCAAAGTAAAAAAGACGAAAAACGGAGAT 

TACATGGCGGTCTTCAACCTCGTTGACAAGACGGGACTAATAGAGTGTGT 3100 

CGTCTTCCCGGGAGTTTACGAAGAGGCAAAGGAACTGATAGAAGAGGACA 

GAGTAGTGGTAGTCAAAGGTTTTCTGGACGAGGACCTTGAAACGGAAAAT 3200 

GTCAAGTTCGTGGTGAAAGAGGTTTTCTCCCCTGAGGAGTTCGCAAAGGA 

GATGAGGAATACCCTTTATATATTCTTAAAAAGAGAGCAAGCCCTAAACG 3300 

GCGTTGCCGAAAAACTAAAGGGAATTATTGAAAACAACAGGACGGAGGAC 

GGATACAACTTGGTTCTCACGGTTGATCTGGGAGACTACTTCGTTGATTT 34 00 

AGCACTCCCACAAGATATGAAACTAAAGGCTGACAGAAAGGTTGTAGAGG 

AGATAGAAAAACTGGGAGTGAAGGTCATAATTTAGTAAATAACCCTTACT 3500 

TCCGAGTAGTCCCC 
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MSKDFVHLHLHTQFSLLDGAIKIDELVKKAKEYGYKAVGMSDHGNLFGSY 
KFYKALKAEGIKPI IGMEAYFTTGSRFDRKTKTSEDNITDKYNHHLILIA 100 
KDDKGLKNLMKLSTLAYKEGFYYKPRIDYELLEKYGEGLIALTACLKGVP 
TYYASINEVKKAEEWVKKFKDIFGDDLYLELQANNI PEQEVANRNLIEIA 200 
KKYDVKLIATQDAHYLNPEDRYAHTVLMALQMKKTIHELSSGNFKCSNED 
LHFAPPEYMWKKFEGKFEGWEKALLNTLEVMEKTADSFEIFENSTYLLPK • 300 
YDVPPDKTLEEYLRELAYKGLRQRIERGQAKDTKEYWERLEYELEVINKM 
GFAGYFLIVQDFINWAKKNDIPVGPGRGSAGGSLVAYAIGITDVDPIKHG 4 00 
FLFERFLNPERVSMPDIDVDFCQDNREKVIEYVRNKYGHDNVAQIITYNV 
MKAKQTLRDVARAMGLPYSTADKLAKLIPQGDVQGTWLSLEEMYKTPVEE 500 
LLQKYGEHRTDIEDNVKKFRQICEESPEIKQLVETALKLEGLTRHTSLHA 
AGVVIAPKPLSELVPLYYDKEGEVATQYDMVQLEELGLLKMDFLGLKTLT 600 
ELKLMKELIKERHGVDINFLELPLDDPKVYKLLQEGKTTGVFQLESRGMK 
ELLKKLKPDSFDDIVAVLALYRPGPLKSGLVDTYIKRKHGKEPVEYPFPE 7 00 
LEPVLKETYGVIVYQEQVMKMSQILSGFTPGEADTLRKAIGKKKADLMAQ 
MKDKFIQGAVERGYPEEKIRKLWEDIEKFASYSFNKSHSVAYGYISYWTA 8 00 
YVKAHYPAEFFAVKLTTEKNDNKFLNLIKDAKLFGFEILPPDINKSDVGF 
TIEGENRIRFGLARIKGVGEETAKIIVEARKKYKQFKGLADFINKTKNRK 900 
I N KK V VE AL VKAG A F D FT KKKRKE LL AK VAN S E KALMAT QN S L FG A P KE E 
VEELDPLKLEKEVLGFYISGHPLDNYEKLLKNRYTPIEDLEEWDKESEAV 1000 
LTGVITELKVKKTKNGDYMAVFNLVDKTGLIECWFPGVYEEAKELIEED 
RWWKGFLDEDLETENVKFWKEVFSPEEFAKEMRNTLYIFLKREQALN 1100 
GVAEKLKGIIENNRTEDGYNLVLTVDLGDYFVDLALPQDMKLKADRKVVE 
E I EKLGVKVI I 1161 
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ATGAACTACGTTCCCTTCGCGAGAAAGTACAGACCGAAATTCTTCAGGGA 
AGTAATAGGACAGGAAGCTCCCGTAAGGATACTCAAAAACGCTATAAAAA 100 
ACGACAGAGTGGCTCACGCCTACCTCTTTGCCGGACCGAGGGGGGTTGQG 
AAGACGACTATTGCAAGAATTCTCGCAAAAGCTTTGAACTGTAAAAATCC 20.0 
CTCCAAAGGTGAGCCCTGCGGTGAGTGCGAAAACTGCAGGGAGATAGACA . 
GGGGTGTGTTCCCTGACTTAATTGAAATGGATGCCGCCTCAAACAGGGGT '300 
ATAGACGACGTAAGGGCATTAAAAGAAGCGGTCAATTACAAACCTATAAA 
AGGAAAGTACAAGGTTTACATAATAGACGAAGCTCACATGCTCACGAAAG 4 00 
AAGCTTTCAACGCTCTCTTAAAAACCCTCGAAGAGCCCCCTCCCAGAACT 
GTTTTCGTCCTTTGTACCACGGAGTACGACAAAATTCTTCCCACGATACT 500 
CTCAAGGTGTCAGAGGATAATCTTCTCAAAGGTAAGAAAGGAAAAAGTAA 
TAGAGTATCTAAAAAAGATATGTGAAAAGGAAGGGATTGAGTGCGAAGAG 600 
GGAGCCCTTGAGGTTCTGGCTCATGCCTCTGAAGGGTQCATGAGGGATGC 
AGCCTCTCTCCTGGACCAGGCGAGCGTTTACGGGGAAGGCAGGGTAACAA 7 00 
AAGAAGTAGTGGAGAACTTCCTCGGAATTCTCAGTCAGGAAAGCGTTAGG 
AGTTTTCTGAAATTGCTTCTGAACTCAGAAGTGGACGAAGCTATAAAGTT 8 00 
CCTCAGAGAACTCTCAGAAAAGGGCTACAACCTGACCAAGTTTTGGGAGA 
TGTTAGAAGAGGAAGTGAGAAACGCAATTTTAGTAAAGAGCCTGAAAAAT 900 
CCCGAAAGCGTGGTTCAGAACTGGCAGGATTACGAAGACTTCAAAGACTA 
CCCTCTGGAAGCCCTCCTCTACGTTGAGAACCTGATAAACAGGGGTAAAG 1000 
TTGAAGCGAGAACGAGAGAACCCTTAAGAGCCTTTGAACTCGCGGTAATA 
AAGAGCCTTATAGTCAAAGACATAATTCCCGTATCCCAGCTCGGAAGTGT 1100 
GGTAAAGGAAACCAAAAAGGAAGAAAAGAAAGTTGAAGTAAAAGAAGAGC 
CAAAAGTAAAAGAAGAAAAACCAAAGGAGCAGGAAGAGGACAGGTTCCAG 1200 
AAAGTTTTAAACGCTGTGGACGGCAAAATCCTTAAAAGAATACTTGAAGG 
GGCAAAAAGGGAAGAAAGAGACGGAAAAATCGTCCTAAAGATAGAAGCCT 1300 
C T TAT C T G AG AA C C AT G AAAAAG G AAT T T G AC T C AC T AAAG GAG AC T T T T 
CCTTTTTTAGAGTTTGAACCCGTGGAGGATAAAAAAAAACCTCAGAAGTC 14 00 
CAGCGGGACGAGGCTGTTTTAAAGGTAAAGGAGCTCTTCAATGCAAAAAT 
ACTCAAAGTACGAAGTAAAAGCTAAGGTCATAAAGGTGAGAATGCCCGTG 1500 
GAAGAGATAGGGCTGTTTAACGCACTAATAGACGGCTTGCCCAGGTACGC 
ACTCACGAGGACGAAGGAAAAGGGAAAGGGAGAAGTTTTCGTTTTAGCGA 1600 
CTCCTTATAAAGTCAAGGAATTGATGGAAGCTATGGAGGGTATGAAAAAA 
CACATAAAGGATTTAGAAATCCTCGGAGAGACGGATGAGGATTTAACTTT 17 00 
TTAAAGTATGGGTGTATCTGAGCAAAGGTTTAAGCTAAAAACAAACCTGA 
AACCCGCAGGGGACCAGCCGAAAGCCATAAAAAAACTCCTTGAAAACCTA 18 00 
AGGAAAGGCGTAAAAGAACAAACACTTCTCGGAGTCACGGGAAGCGGAAA 
GACTTTTACTCTAGCAAACGTAATAGCGAAGTACAACAAACCAACTCTTG 1900 
TGGTAGTTCACAACAAAATTCTCGCGGCACAGCTATACAGGGAGTTTAAA 
GAACTATTCCCTGAAAACGCTGTAGAGTACTTTGTCTCTTACTACGACTA 2000 
TTACCAACCTGAAGCCTACATTCCCGAAAAAGATTTATACATAGAAAAGG 
ACGCGAGTATAAACGAAAGCTGGAACGTTTCAGACACTCCGCCACGATAT 2100 
CCGTTCTAGAAAGGAGGGACGTTATAGTAGTTGCTTCAGTTTCTTGCATA 
TACGGACTCGGGAAACCTGAGCACTACGAAAACCTGAGGATAAAACTCCA 2200 
AAGGGGAATAAGACTGAACTTGAGTAAGCTCCTGAGGAAACTCGTTGAGC 
TAGGATATCAGAGAAATGACTTTGCCATAAAGAGGGCTACCTTCTCGGTT 2300 
AGGGGAGACGTGGTTGAGATAGTCCCTTCTCACACGGAAGATTACCTCGT 
GAGGGTAGAGTTCTGGGACGACGAAGTTGAAAGAATAGTCCTCATGGACG 24 00 
CTCTGAAC 
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MNYVPFARKYRPKFFREVIGQEAPVRILKNAIKNDRVAHAYLFAGPRGVG ./ - • 

KTTIARILAKALNCKNPSKGEPCGECENCREIDRGVFPDLIEMDAASNRG- 100 

IDDVRALKEAVNYKPIKGKYKVYIIDEAHMLTKEAFNALLKTLEEPPPRT 

VFVLCTTEYDKILPTILSRCQRIIFSKVRKEKVIEYLKKICEKEGIECEE 200 

GALEVLAHASEGCMRDAASLLDQASVYGEGRVTKEWENFLGILSQESVR 

SFLKLLLNSEVDEAIKFLRELSEKGYNLTKFWEMLEEEVRNAILVKSLKN 300 

PESVVQNWQDYEDFKDYPLEALLYVENLINRGKVEARTREPLRAFELAVI 

KSLI VKDI I PVSQLGS WKETKKEEKKVEVKEEPKVKEEKPKEQEEDRFQ 4 00 

KVLNAVDGKILKRILEGAKREERDGKIVLKIEASYLRTMKKEFDSLKETF 

PFLEFEPVEDKKKPQKSSGTRLF 47 3 
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atgcgcgttaaggtggacagggaggagcttgaagaggttcttaaaaaagc 
aagagaaagcacggaaaaaaaagccgcactcccgatactcgcgaacttct 100 
tactctccgcaaaagaggaaaacttaatcgtaagggcaacggacttggaa 
aactaccttgtagtctccgtaaagggggaggttgaagaggaaggagaggt 2 00- ■ 
ttgcgtccactctcaaaaactctacgatatagtcaagaacttaaattccg ■ 
cttacgtttaccttcatacggaaggtgaaaaactcgtcataacgggagga \ 300 
aagagtacgtacaaacttccgacagctcccgcggaggactttcccgaatt 
tccagaaatcgtagaaggaggagaaacactttcgggaaaccttgtcgtta 4 0 0 . 
acggaatagaaaaggtagagtacgccatagcgaaggaagaagcgaacata 
gcccttcagggaatgtatctgagaggatacgaggacagaattca'ctttgt 500 
gttcggacggtcacaggcttgcactttatgaacctctacgtaaacattga 
aaagagtgaagacgagtcttttgcttacttctccactcccgagtggaaac 600. 
tcgccgttagctcctggaaggagaattcccggactagatgagtgtcatcc ' • : v 
ctgaggagttttcggcggaagtcttgtttgagacagaggaagtcttaaag 7 0.0 
gttttaaagaggttgaaggctttaagcgaaggaaaagtttttcccgtgaa 
gattaccttaagcgaaaaccttgccatctttgagttcgcggatccggagt 8 00*/ 
tcggagaagcgagagaggaaattgaagtggagtacacgggagagcccttt 
gagataggattcaacggaaataccttatggaggcgcttgacgcctacgac 900 
agcgaaagagtgtggttcaagttcacaacccccgacacggccactttatt 
ggaggctgaagattacgaaaaggaaccttacaagtgcataataatgccga 1000 
tgagggtgtagccatgaa7vaaagctttaatctttttattgagcttgagcc 
ttttaattcctgcgtttagcgaagccaaacccaagtcttc 10 90 
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MRVKVDREELEEVLKKARESTEKKAALPILANFLLSAKEENLIVRATDLE 
NYLVVSVKGEVEEEGEVCVHSQKLYDIVKNLNSAYVYLHTEGEKLVITGG 100 
KSTYKLPTAPAEDFPEFPEIVEGGETLSGNLLVNGIEKVEYAIAKEEANI 
ALQGMYLRGYEDRIHFVGSDGHRLALYEPLGEFSKELLI PRKSLKVLKKL 2 00 
ITGIEDVNIEKSEDESFAYFSTPEWKLAVRLLEGEFPDYMSVIPEEFSAE 
VLFETEEVLKVLKRLKALSEGKVFPVKITLSENLAIFEFADPEFGEAREE 300 
IEVEYTGEPFEIGFNGKYLMEALDAYDSERVWFKFTTPDTATLLEAEDYE 
KEPYKCIIMPMRV 363 
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GTGGAAACCACAATATTCCAGTTCCAGAAAACTTTTTTCACAAAACCTCC 
GAAGGAGAGGGTCTTCGTCCTTCATGGAGAAGAGCAGTATCTCATAAGAA 100 
CCTTTTTGTCTAAGCTGAAGGAAAAGTACGGGGAGAATTACACGGTTCTG 
TGGGGGGATGAGATAAGCGAGGAGGAATTCTACACTGCCCTTTCCGAGAC 200 
CAGTATATTCGGCGGTTCAAAGGAAAAAGCGGTGGTCATTTACAACTTCG 
GGGATTTCCTGAAGAAGCTCGGAAGGAAGAAAAAGGAAAAAGAAAGGCTT 300 
ATAAAAGTCCTCAGAAACGTAAAGAGTAACTACGTATTTATAGTGTACGA 
TGCGAAACTCCAGAAACAGGAACTTTCTTCGGAACCTCTGAAATCCGTAG 4 00 
CGTCTTTCGGCGGTATAGTGGTAGCAAACAGGCTGAGCAAGGAGAGGATA . 
AAAC AG CTCGTCCT T AA G AA G T T C AAAG AAAAA G G G AT AAAC GT AG AAAA 500 
CGATGCCCTTGAATACCTTCTCCAGCTCACGGGTTACAACTTGATGGAGC 
TCAAACTTGAGGTTGAAAAACTGATAGATTACGCAAGTGAAAAGAAAATT 600 
TTAACACTCGATGAGGTAAAGAGAGTAGCCTTCTCAGTCTCAGAAAACGT 
AAACGTATTTGAGTTCGTTGATTTACTCCTCTTAAAAGATTACGAAAAGG . 7 00 
CTCTTAAAGTTTTGGACTCCCTCATTTCCTTCGGAATACACCCCCTCCAG 
ATTATGAAAATCCTGTCCTCCTATGCTCTAAAACTTTACACCCTCAAGAG 8 00 
GCTTGAAGAGAAGGGAGAGGACCTGAATAAGGCGATGGAAAGCGTGGGAA 
T AAAG AAC AAC T T T C T C AAG AT G AA G T T C AAAT C T T AC T T AAAG G C AAAC 900 
TC T AAAG AG G AC T T G AAG AAC C T AAT CCTCTCCCTC C AG AGG AT AG AC GC 
TTTTTCTAAACTTTACTTTCAGGACACAGTGCAGTTGCTGGGGATTTCTT 1000 
GACCTCAAGACTGGAGAGGGAAGTTGTGAAAAATACTTCTCATGGTGGAT 
AATCTTTTTTATGAAGTTTGCGGTTTGCGTTTTTCCCGGTTCT 1093 
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VETTIFQFQKTFFTKPPKERVFVLHGEEQYLIRTFLSKLKEKYGENYTVL 
WGDEISEEEFYTALSETSIFGGSKEKAVVIYNFGDFLKKLGRKKKEKERL 100 
IKVLRNVKSNYVFIVYDAKLQKQELSSEPLKSVASFGGIWANRLSKERI 
KQLVLKKFKEKGINVENDALEYLLQLTGYNLMELKLEVEKLIDYASEKKI 2 00 
LTLDEVKRVAFSVSENVNVFEFVDLLLLKDYEKALKVLDSLISFGIHPLQ 
IMKILSSYALKLYTLKRLEEKGEDLNKAMESVGIKNNFLKMKFKSYLKAN 300 
SKEDLKNLILSLQRIDAFSKLYFQDTVQLLRDFLTSRLEREWKNTSHGG 
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ATGGAAAAAGTTTTTTTGGAAAAACTCCAGAAAACCTTGCACATACCCGG 
AGGACTCCTTTTTTACGGCAAAGAAGGAAGCGGAAAGACGAAAACAGCTT 100 
TTGAATTTGCAAAAGGTATTTTATGTAAGGAAAACGTACCTGGGGATGCG * 1 
GAAGTTGTCCCTCCTGCAAACACGTAAACGAGCTGGAGGAAGCCTT-CTTT 200 • 
AAAG G AG AAAT AG AAG AC T T T AAAG T T T AT AAG AC AAG G AC G G T AAAAAG " 
CACTTCGTTTACCTTATGGGCGAACATCCCGACTTTGTGGTAATAATCCC 300 
GAG C G G AC AT T AC AT AAAG AT AG AAC AG AT AAG GG AAGT T AAG AAC T T T G 
CCTATGTGAAGCCCGCACTAAGCAGGAGAAAAGTAATTATAATAGACGAC 4 00 
GCCCACGCGATGACCTCTCAGGCGGCAAACGCTCTTTTAAAGGTATTGGA 
AGAGCCACCTGCGGACACCACCTTTATCTTGACCACGAACAGGCGTTCTG 500 
CAATCCTGCCGACTATCCTCTCCAGAACTTTTCAAGTGGAGTTCAAGGGC 
TTTTCAGTAAAAGAGGTTATGGAAATAGCGAAAGTAGACGAGGAAATAGC 600 
GAAACTCTCTGGAGGCAGTCTAAAAAGGGCTATCTTACTAAAGGAAAACA 
AAGATATCCTAAACAAAGTAAAGGAATTCTTGGAAAACGAGCCGTTAAAA 7 00 
GTTTACAAGCTTGCAAGTGAATTCGAAAAGTGGGAACCTGAAAAGCAAAA 
ACTCTTCCTTGAAATTATGGAAGAATTGGTATCTCAAAAATTGACCGAAG 8 00 
AG AAAAAAG AC AAT T AC AC C T AC C T T C T T GAT AC GAT C AG AC T C T T T AAA 
GACGGACTCGCAAGGGGTGTAAACGAACCTCTGTGGCTGTTTACGTTAGC 900 
CGTTCAGGCGGATTAATAAACCGTTATTGATTCCGTAACATTTAAACCTT 
AATCTAAATTATGAGAGCCTTTGAAGGAGGTCTGGTATGGAAAATTTGAA 1000 
GATTAGATATATAGATACGAGGAAGATAGGAACCGTGAGCGGTGTAAAAG 
T 1051 
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MEKVFLEKLQKTLHIPGGLLFYGKEGSGKTKTAFEFAKGILCKENVPWGC 
GSCPSCKHVNELEEAFFKGEIEDFKVYKDKDGKKHFVYLMGEHPDFWII 100 
PSGHYIKIEQIREVKNFAYVKPALSRRKVIIIDDAHAMTSQAANALLKVL 
EEPPADTTFILTTNRRSAILPTILSRT FQVEFKGFSVKEVMEIAKVDEEI 200 
AKLSGGSLKRAILLKENKDILNKVKEFLENEPLKVYKLASEFEKWEPEKQ 
KLFLEIMEELVSQKLTEEKKDNYTYLLDTIRLFKDGLARGVNEPLWLFTL 300 
AVQAD 
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ATGAACTTCCTGAAAAAGTTCCTTTTACTGAGAAAAGCTCAAAAGTCTCC 
TTACTTCGAAGAGTTCTACGAAGAAATCGATTTGAACCAGAAGGTGAAAG 
ATGCAAGGTTTGTAGTTTTTGACTGCGAAGCCACAGAACTCGACGTAAAG 
AAGGCAAAACTCCTTTCAATAGGTGCGGTTGAGGTTAAAAACCTGGAAAT 
A G AC C T C T C T AAAT C T T T T T AC GAG AT AC T C AAAAG T GAG G AG AT AAAG G 
CGGCGGAGATACATGGAATAACCAGGGAAGACGTTGAAAAGTACGGAAAG 
GAACCAAAGGAAGTAATATACGACTTTCTGAAGTACATAAAGGGAAGCGT 
TCTCGTTGGCTACTACGTGAAGTTTGACGTCTCACTCGTTGAGAAGTACT 
CCATAAAGTACTTCCAGTATCCAATCATCAACTACAAGTTAGACCTGTTT 
AGTTTCGTGAAGAGAGAGTACCAGAGTGGCAGGAGTCTTGACGACCTTAT 
GAAGGAACTCGGTGTAGAAATAAGGGCAAGGCACAACGCCCTTGAAGATG 
CCTACATAACCGCTCTTCTTTTCCTAAAGTACGT^TACCCGAACAGGGAG 
TACAGACTAAAGGATCTCCCGATTTTCCTT I 
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MNFLKKFLLLRKAQKSPYFEEFYEEIDLNQKVKDARFWFDCEATELDVK 
KAKLLSIGAVEVKNLEIDLSKSFYEILKSDEIKAAEIHGITREDVEKYGK 
EPKEVIYDFLKYIKGSVLVGYYVKFDVSLVEKYSIKYFQYPIINYKLDLF 
SFVKREYQSGRSLDDLMKELGVEIRARHNALEDAYITALLFLKYVYPNRE 
YRLKDLPIFL 
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ATGCTCAATAAGGTTTTTATAATAGGAAGACTTACGGGTGACCCCGTTAT 

AACTTATCTACCGAGCGGAACGCCCGTAGTAGAGTTTACTCTGGCTTACA 100 
ACAGAAGGTATAAAAACCAGAACGGTGAATTTCAGGAGGAAAGTCACTTC 

TTTGACGTAAAGGCGTACGGAAAAATGGCTGAAGACTGGGCTACAGGCTT '2 00. ; 

CTCGAAAGGATACCTCGTACTCGTAGAGGGAAGACTCTC.CCAGGAAAAGT ■ " - 

GGGAGAAAGAAGGAAAGAAGTTCTCAAAGGTCAGGATAATAGCGGAAAAC 300 
GTAAGATTAATAAACAGGCCGAAAGGTGCTGAACTTCAAGCAGAAGAAGA 

GGAGGAAGTTCCTCCCATTGAGGAGGAAATTGAAAAACTCGGTAAAGAGG .4 00 
AAGAGAAGCCTTTTACCGATGAAGAGGACGAAATACCTTTTTAATTTTGA 

GGAGGTTAAAGTATGGTAGTGAGAGCTCCTAAGAAGAAAGTTTGTATGTA 500 
C T GT G AAC AAAAG AG AG AG C C AG AT T \." 
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MLNKVFIIGRLTGDPVITYLPSGTPWEFTLAYNRRYKNQNGEFQEESHF 
FDVKAYGKMAEDWATRFSKGYLVLVEGRLSQEKWEKEGKKFSKVRIIAEN 100 
VRLINRPKGAELQAEEEEEVPPIEEEIEKLGKEEEKPFTDEEDEIPF 
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ATGCAATTTGTGGATAAACTTCCCTGTGACGAATCCGCCGAGAGGGCGGT 

TCTTGGCAGTATGCTTGAAGACCCCGAAAACATACCTCTGGTACTTGAAT 100 

ACCTTAAAGAAGAAGACTTCTGCATAGACGAGCACAAGCTACTTTTCAGG 

GTTCTTACAAACCTCTGGTCCGAGTACGGCAATAAGCTCGATTTCGTATT 200 

AAT AAAG GAT C AC C T T G AAAAG AAAAAC T T AC T C C AG AAAAT AC C TAT AG 

ACTGGCTCGAAGAACTCTACGAGGAGGCGGTATCCCCTGACACGCTTGAG 300 

GAAGTCTGCAAAATAGTAAAACAACGTTCCGCACAGAGGGCGATAATTCA 

AC T C GGT AT AG AAC T C AT T C AC AAAGG AAAG G AAAAC AAAG AC T T T C AC A 4 00 

CATTAATCGAGGAAGCCCAGAGCAGGATATTTTCCATAGCGGAAAGTGCT 

ACATCTACGCAGTTTTACCATGTGAAAGACGTTGCGGAAGAAGTTATAGA 500 

ACTCATTTATAAATTCAAAAGCTCTGACAGGCTAGTCACGGGACTCCCAA 

GCGGTTTCACGGAACTCGATCTAAAGACGACGGGATTCCACCCTGGAGAC 600 

TTAATAATACTCGCCGCAAGACCCGGTATGGGGAAAACCGCCTTTATGCT 

CTCCATAATCTACAATCTCGCAAAAGACGAGGGAAAACCCTCAGCTGTAT 7 00, 

TTTCCTTGGAAATGAGCAAGGAACAGCTCGTTATGAGACTCCTCTCTATG 

ATGTCGGAGGTCCCACTTTTCAAGATAAGGTCTGGAAGTATATCGAATGA 8 00 

AGATTTAAAGAAGCTTGAAGCAAGCGCAATAGAACTCGCAAAGTACGACA 

TATACCTCGACGACACACCCGCTCTCACTACAACGGATTTAAGGATAAGG 900 

GCAAGAAAGCTCAGAAAGGAAAAGGAAGTTGAGTTCGTGGCGGTGGACTA 

CTTGCAACTTCTGAGACCGCCAGTCCGAAAGAGTTCAAGACAGGAGGAAG 1000 

TGGCAGAGGTTTCAAGAAACTTAAAAGCCCTTGCAAAGGAACTTCACATT 

CCCGTTATGGCACTTGCGCAGCTCTCCCGTGAGGTGGAAAAGAGGAGTGA 1100 

T AAAAG AC C C C AG C T T G C G G AC C T C AG AG AAT C C GG AC AG AT AG AAC AGG 

ACGCAGACCTAATCCTTTTCCTCCACAGACCCGAGTACTACAAGAAAAAG 1200 

CCAAATCCCGAAGAGCAGGGTATAGCGGAAGTGATAATAGCCAAGCAAAG 

GCAAGGACCCACGGACATTGTGAAGCTCGCATTTATTAAGGAGTACACTA 1300 

AGTTTGCAAACCTAGAAGCCCTTCCTGAACAACCTCCTGAAGAAGAGGAA 

C T T T C C G AAAT T AT T G AAAC AC AG GAG GAT G AAG GAT T C G AAG AT AT T G A 14 00 

CTTCTGAAAATTAAGGTTTTATAATTTTATCTTGGCTATCCGGGGTAGCT 

CAATCGGCAGAGCGGGTGGCTG 14 72 
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MQFVDKLPCDESAERAVLGSMLEDPENIPLVLEYLKEEDFCIDEHKLLFR 
VLTNLWSEYGNKLDFVLIKDHLEKKNLLQKIPIDWLEELYEEAVSPDTLE 100 
EVCKIVKQRSAQRAI IQLGITSTQFYHVKDVAEEVIELIYKFKSSDRLVT 
GLPSGFTELDLKTTGFHPGDLI ILAARPGMGKTAFMLS I I YNLAKDEGKP 200 
SAVFSLEMSKEQLVMRLLSMMSEVPLFKIRSGSISNEDLKKLEASAIELA 
KYDIYLDDTPALTTTDLRIRARKLRKEKEVEFVAVDYLQLLRPPVRKSSR 300 
QEEVAEVSRNLKALAKELHIPVMALAQLSREVEKRSDKRPQLADLRESGQ 
IEQDADLILFLHRPEYYKKKPNPEEQGIAEVIIAKQRQGPTDIVKLAFIK 4 00 
EYTKFANLEALPEQPPEEEELSEIIETQEDEGFEDIDF 
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ATGTCCTCGGACATAGACGAACTTAGACGGGAAATAGATATAGTAGACGT 

CATTTCCGAATACTTAAACTTAGAGAAGGTAGGTTCCAATTACAGAACGA >v l 0 0 

ACTGTCCCTTTCACCCTGACGATACACCCTCCTTTTACGTGTCTCCAAGT. 

AAACAAATATTCAAGTGTTTCGGTTGCGGGGTAGGGGGAGACGCGATAAA. - 200 . 

GTTCGTTTCCCTTTACGAGGACATCTCCTATTTTGAAGCCGCCCTTGAAC 

T C GC AAAAC G C T AC GG AAAG AAAT TAG AC C T T G AAAAG AT AT C AAAAG AC 300 

GAAAAGGTATACGTGGCTCTTGACAGGGTTTGTGATTTCTACAGGGAAAG 

CCTTCTCAAAAACAGAGAGGCAAGTGAGTACGTAAAGAGTAGGGGAATAG 4 0 0 

ACCCTAAAGTAGCGAGGAAGTTTGATCTTGGGTACGCACCTTCCAGTGAA 

GCACTCGTAAAAGTCTTAAAAGAGAACGATCTTTTAGAGGCTTACCTTGA 500 

AACTAAAAACCTCCTTTCTCCTACGAAGGGTGTTTACAGGGATCTCTTTC 

TTCGGCGTGTCGTGATCCCGATAAAGGATCCGAGGGGAAGAGTTATAGGT 600 * 

TTCGGTGGAAGGAGGATAGTAGAGGACAAATCTCCCAAGTACATAAACTC 

TCCAGACAGCAGGGTATTTAAAAAGGGGGAGAACTTATTCGGTCTTTACG 7 00 

AGGCAAAGGAGTATATAAAGGAAGAAGGATTTGCGATACTTGTGGAAGGG 

TACTTTGACCTTTTGAGACTTTTTTCCGAGGGAATAAGGAACGTTGTTGC 800 

ACCCCTCGGTACAGCCCTGACCCAAAATCAGGCAAACCTCCTTTCCAAGT 

TCACAAAAAAGGTCTACATCCTTTACGACGGAGATGATGCGGGAAGAAAG 900 

GCTATGAAAAGTGCCATTCCCCTACTCCTCAGTGCAGGAGTGGAAGTTTA 

TCCCGTTTACCTCCCCGAAGGATACGATCCCGACGAGTTTATAAAGGAAT 1000 

TCGGGAAAGAGGAATTAAGAAGACTGATAAACAGCTCAGGGGAGCTCTTT 

GAAACGCTCATAAAAACCGCAAGGGAAAACTTAGAGGAGAAAACGCGTGA 1100 

GTTCAGGTATTATCTGGGCTTTATTTCCGATGGAGTAAGGCGCTTTGCTC 

TGGCTTCGGAGTTTCACACCAAGTACAAAGTTCCTATGGAAATTTTATTA 1200 

ATGAAAATTGAAAAAAATTCTCAAGAAAAAGAAATTAAACTCTCCTTTAA 

G G AAAAAAT C T T CC T G AAAG G AC T G AT AG AAT T AAAAC C AAAAAT AG AC C 1300 

TTGAAGTCCTGAACTTAAGTCCTGAGTTAAAGGAACTCGCAGTTAACGCC 

TTAAACGGAGAGGAGCATTTACTTCCAAAAGAAGTTCTCGAGTACCAGGT 14 00 

GGATAACTTGGAGAAACTTTTTAACAACATCCTTAGGGATTTACAAAAAT 

CTGGGAAAAAGAGGAAGAAAAGAGGGTTGAAAAATGTAAATACTTAATTA 1500 

ACTTTAATAAATTTTTAGAGTTAGGA 
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MSSDIDELRREIDIVDVISEYLNLEKVGSNYRTNCPFHPDDTPSFYVSPS 

KQIFKCFGCGVGGDAIKFVSLYEDISYFEAALELAKRYGKKLDLEKISKD 100 
EKVYVALDRVCDFYRESLLKNREASEYVKSRGIDPKVARKFDLGYAPSSE 

ALVKVLKENDLLEAYLETKNLLSPTKGVYRDLFLRRWIPIKDPRGRVIG 2 00 
FGGRRIVEDKSPKYINSPDSRVFKKGENLFGLYEAKEYIKEEGFAILVEG 

YFDLLRLFSEGIRNWAPLGTALTQNQANLLSKFTKKVYILYDGDDAGRK 300 
AMKSAIPLLLSAGVEVYPVYLPEGYDPDEFIKEFGKEELRRLINSSGELF 

ETLIKTARENLEEKTREFRYYLGFISDGVRRFALASEFHTKYKVPMEILL 4 00 
MKIEKNSQEKEIKLSFKEKIFLKGLIELKPKIDLEVLNLSPELKELAVNA 

LNGEEHLLPKEVLEYQVDNLEKLFNNILRDLQKSGKKRKKRGLKNVNT 4 98 
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ATGCAAGATACCGCTACCTGCAGTATTTGTCAGGGGACGGGATTCGTAAA 
GACCGAAGACAACAAGGTAAGGCTCTGCGAATGCAGGTTCAAGAAAAGGG 
ATGTAAACAGGGAACTAAACATCCCAAAGAGGTACTGGAACGCCAACTTA 
GACACTTACCACCCCAAGAACGTATCCCAGAACAGGGCACT.TTTGACGAT 
AAGGGTCTTCGTCCACAACTTCAATCCCGAGGAAGGGAAAGGGCTTACCT, 
TTGTAGGATCTCCTGGAGTCGGCAAAACTCACCTTGCGGTTGCAACATTA 
AAAGCGATTTATGAGAAGAAGGGAATCAGAGGATACTTCTTCGATACGAA 
GGATCTAATATTCAGGTTAAAACACTTAATGGACGAGGGAAAGGATACAA 
AGTTTTTAAAAACTGTCTTAAACTCACCGGTTTTGGTTCTCGACGACCTC 
GGTTCTGAGAGGCTCAGTGACTGGCAGAGGGAACTCATCTCTTACATAAT 
CACTTACAGGTATAACAACCTTAAGAGCACGATAATAACCACGAATTACT 
CACTCCAGAGGGAAGAAGAGAGTAGCGTGAGGATAAGTGCGGATCTTGCA 
AGCAGACTCGGAGAAAACGTAGTTTCAAAAATTTACGAGATGAACGAGTT 
GCTCGTTATAAAGGGTTCCGACCTCAGGAAGTCTAAAAAGCTATCAACCC 
CATCT 
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MQDTATCSICQGTGFVKTEDNKVRLCECRFKKRDVNRELNIPKRYWNANL 
DTYHPKNVSQNRALLTIRVFVHNFNPEEGKGLTFVGSPGVGKTHLAVATL 
KAIYEKKGIRGYFFDTKDLIFRLKHLMDEGKDTKFLKTVLNSPVLVLDDL 
GSERLSDWQRELISYIITYRYNNLKSTIITTNYSLQREEESSVRISADLA 
SRLGENVVSKIYEMNELLVIKGSDLRKSKKLSTPS 
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ATGAAAAAGATTGAAAATTTGAAGTGGAAAAATGTCTCGTTTAAAAGCCT 
GGAAATAGATCCCGATGCAGGTGTGGTTCTCGTTTCCGTGSAAAAATTCT/ * 100 
CCGAAGAGATAGAAGACCTTGTGCGTTTACTGGAGAAGAAGACGCGGTTT . - 
CGAGTCATCGTGAACGGTGTTCAAAAAAGTAACGGGGATCTAAGGGGAAA 200 
GATACTTTCCCTTCTCAACGGTAATGTGCCTTACATAAAAGATGTTGTTT 
TCGAAGGAAACAGGCTGAT.TCTGAAAGTGCTTGGAGATTTCGCGCGGGAC 300 
AGGATCGCCTCCAAACTCAGAAGCACGAAAAAACAGCTCGATGAACTGCT 
GCCTCCCGGAACAGAGATCATGCTGGAGGTTGTGGAGCCTCCGGAAGATC 4 00 
TTTTGAAAAAGGAAGTACCACAACCAGAAAAGAGAGAAGAACCAAAGGGT 
G AAG AAT T G AAG AT C GAG GAT G AAAAC C AC AT C T T T G G AC AG AAAC C C AG 500 
AAAGATCGTCTTCACCCCCTCAAAAATCTTTGAGTACAACAAAAAGACAT 
CGGTGAAGGGCAAGATCTTCAAAATAGAGAAGATCGAGGGGAAAAGAACG 600 
GTCCTTCTGATTTACCTGACAGACGGAGAAGATTCTCTGATCTGCAAAGT 
CTTCAACGACGTTGAAAAGGTCGAAGGGAAAGTATCGGTGGGAGACGTGA 700 
TCGTTGCCACAGGAGACCTCCTTCTCGAAAACGGGGAGCCCACCCTTTAC 
GTGAAGGGAATCACAAAACTTCCCGAAGCGAAAAGGATGGACAAATCTCC 800 
GGTTAAGAGGGTGGAGCTCCACGCCCATACCAAGTTCAGCGATCAGGACG 
CAATAACAGATGTGAACGAATATGTGAAACGAGCCAAGGAATGGGGCTTT 900 
CCCGCGATAGCCCTCACGGATCATGGGAACGTTCAGGCCATACCTTACTT 
CTACGACGCGGCGAAAGAAGCTGGAATAAAGCCCATTTTCGGTATCGAAG 1000 
CGTATCTGGTGAGTGACGTGGAGCCCGTCATAAGGAATCTCTCCGACGAT 
TCGACGTTTGGAGATGCCACGTTCGTCGTCCTCGACTTCGAGACGACGGG 1100 
TCTCGACCCGCAGGTGGATGAGATCATCGAGATAGGAGCGGTGAAGATAC 
AGGGTGGCCAGATAGTGGACGAGTACCACACTCTCATAAAGCCTTCCAGG 1200 
GAGATCTCAAGAAAAAGTTCGGAGATCACCGGAATCACTCAAGAGATGCT 
GGAAAACAAGAGAAGCATCGAGGAAGTTCTGCCGGAGTTCCTCGGTTTTC 1300. 
TGGAAGATTCCATCATCGTAGCACACAACGCCAACTTCGACTACAGATTT 
CTGAGGCT GTGGATCAAAAAAGTGATGGGATTGGACTGGGAAAGACCCTA 14 00 
CATAGATACGCTCGCCCTCGCAAAGTCCCTTCTCAAACTGAGAAGCTACT 
CTCTGGATTCCGTTGTGGAAAAGCTCGGATTGGGTCCCTTCCGGCACCAC - 1500 
AGGGCCCTGGATGACGCGAGGGTCACCGCTCAGGTTTTCCTCAGGTTCGT ' 
TGAGATGATGAAGAAGATCGGTATCACGAAGCTTTCAGAAATGGAGAAGT 1600 
TGAAGGATACGATAGACTACACCGCGTTGAAACCCTTCCACTGCACGATC 
CTCGTTCAGAACAAAAAGGGATTGAAAAACCTATACAAACTGGTTTCTGA 17 00 
TTCCTATATAAAGTACTTCTACGGTGTTCCGAGGATCCTCAAAAGTGAGC 
TCATCGAGAACAGAGAAGGACTGCTCGTGGGTAGCGCGTGTATCTCCGGT 1800 
GAGCTCGGACGTGCCGCCCTCGAAGGAGCGAGTGATTCAGAACTCGAAGA 
GATCGCGAAGTTCTACGACTACATAGAAGTCATGCCGCTCGACGTTATAG 1900 
C C G AAG AT G AAG AAG AC C T A G A C AG AG AAAG AC T G AAAG AAG T G T AC C G A 
AAACTCTACAGAATAGCGAAAAAATTGAACAAGTTCGTCGTCATGACCGG 2000 
TGATGTTCATTTCCTCGATCCCGAAGATGCCAGGGGCAGAGCTGCACTTC 
TGGCACCTCAGGGAAACAGAAACTTCGAGAATCAGCCCGCACTCTACCTC 2100 
AG AAC G AC C G AAG AAAT G C T C GAG AAG G C GAT AG AG AT AT T C G AAG AT G A 
AGAGATCGCGAGGGAAGTCGTGATAGAGAATCCCAACAGAATAGCCGATA 2200 
TGATCGAGGAAGTGCAGCCGCTCGAGAAAAAACTTCACCCGCCGATCATA 
GAGAACGCCGATGAAATAGTGAGAAACCTCACCATGAAGCGGGCGTACGA 2300 
GATCTACGGTGATCCGCTTCCCGAAATCGTCCAGAAGCGTGTGGAAAAGG 
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aactgaacgccatcataaatcatggatacgccgttctctatctcatcgct ' 2*4 00 
caggagctcgttcagaaatctatgagcgatggttacgtggttggatccag 

aggatccgtcgggtcttcactcgtggccaatctcctcggaataacagagg 2500 

tgaatcccctaccaccacattacaggtgtccagagtgcaaatactttgaa ' 

gttgtcgaagacgacagatacggagcgggttacgaccttcccaacaagaa 2 600 
ctgtccaagatgtggggctcctctcagaaaagacggccacggcataccgt 

ttgaaacgttcatggggttcgagggtgacaaggtccccgacatagatctc 2 7 00 
aacttctcaggagagtatcaggaacgtg'ctcatcgttttgtggaagaact 

cttcggtaaagaccacgtctatagggcgggaaccataaacaccatcgcgg 2 8 00 
aaagaagtgcggtgggttacgtgagaagctacgaagagaaaaccggaaag 

aagctcagaaaggcggaaatggaaagactcgtttccatgatcacgggagt 2 900 
gaagagaacgacgggtcagcacccaggggggctcatgatcataccgaaag 

acaaagaagtctacgatttcactcccatacagtatccagccaacgataga 3000 
aacgcaggtgtgttcaccacgcacttcgcatacgagacgatccatgatga 

cctggtgaagatagatgcgctcggccacgatgatcccactttcatcaaga 3100 
tgctcaaggacctcaccggaatcgatcccatgacgattcccatggatgac 

cccgatacgctcgccatattcagttctgtgaagcctcttggtgtggatcc 3200 
cgttgagctggaaagcgatgtgggaacgtacggaattccggagttcggaa 

ccgagtttgtgaggggaatgctcgttgaaacgagaccaaagagtttcgcc 3300 
gagcttgtgagaatctcaggactgtcacacggtacggacgtctggttgaa 

caacgcacgtgattggataaacctcggctacgccaagctctccgaggtta 34 00 
tctcgtgtagggacgacatcatgaacttcctcatacacaaaggaatggaa 

ccgtcacttgccttcaagatcatggaaaacgtcaggaagggaaagggtat 3500. 
cacagaagagatggagagcgagatgagaaggctgaaggttccagaatggt 

tcatcgaatcctgtaaaaggatcaaatatctcttcccgaaagctcacgct 3 600 
gtggcttacgtgagtatggccttcagaattgcttacttcaaggttcacta 

tcctcttcagttttacgcggcgtacttcacgataaaaggtgatcagttcg 37 00 
atccggttctcgtactcaggggaaaagaagccataaagaggcgcttgaga 

gaactcaaagcgatgcctgccaaagacgcccagaagaaaaacgaagtgag 3800 
tgttctggaggttgccctggaaatgatactgagaggtttttccttcctac 

cgcccgacatcttcaaatccgacgcgaagaaatttctgatagaaggaaac 3 900 
tcgctgagaattccgttcaacaaacttccaggactgggtgacagcgttgc 

cgagtcgataatcagagccagggaagaaaagccgttcacttcggtggaag 4 000 

AT C T CAT G AAG AG G AC C AAG G T C AAC AAAAAT C AC AT AG AG C T GAT GAAA 

AGCCTGGGTGTTCTCGGGGACCTTCCAGAGACGGAACAGTTCACGCTTTT 4100 
C 
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MKKIENLKWKNVSFKSLEIDPDAGVVLVSVEKFSEEIEDLVRLLEKKTRF 
RVIVNGVQKSNGDLRGKILSLLNGNVPYIKDVVFEGNRLILKVLGDFARD 100 
RIASKLRSTKKQLDELLPPGTEIMLEWEPPEDLLKKEVPQPEKREEPKG 
EELKIEDENHIFGQKPRKIVFTPSKIFEYNKKTSVKGKIFKIEKIEGKRT '. 2 00 
VLLIYLTDGEDSLICKVFNDVEKVEGKVSVGDVIVATGDLLLENGEPTLY 
VKGITKLPEAKRMDKSPVKRVELHAHTKFSDQDAITDVNEYVKRAKEWGF 300 
PAIALTDHGNVQAIPYFYDAAKEAGIKPIFGIEAYLVSDVEPVIRNLSDD 
STFGDATFWLDFETTGLDPQVDEIIEIGAVKIQGGQIVDEYHTLIKPSR 4 00 
EISRKSSEITGITQEMLENKRSIEEVLPEFLGFLEDSIIVAHNANFDYRF 
LRLWIKKVMGLDWERPYIDTLALAKSLLKLRSYSLDSVVEKLGLGPFRHH 500 
RALDDARVTAQVFLRFVEMMKKIGITKLSEMEKLKDTIDYTALKPFHCTI 
LVQNKKGLKNLYKLVSDSYIKYFYGVPRILKSELIENREGLLVGSACISG 600 
ELGRAALEGASDSELEEIAKFYDYIEVMPLDVIAEDEEDLDRERLKEVYR 
KLYRIAKKLNKFWMTGDVHFLDPEDARGRAALLAPQGNRNFENQPALYL 7 00 
RTTEEMLEKAIEIFEDEEIAREWIENPNRIADMIEEVQPLEKKLHPPII 
ENADEIVRNLTMKRAYEIYGDPLPEIVQKRVEKELNAIINHGYAVLYLIA 800 
QELVQKSMSDGYWGSRGSVGSSLVANLLGITEVNPLPPHYRCPECKYFE 
WEDDRYGAGYDLPNKNCPRCGAPLRKDGHGIPFETFMGFEGDKVPDIDL 900 
NFSGEYQERAHRFVEELFGKDHVYRAGTINTIAERSAVGYVRSYEEKTGK 
KLRKAEMERLVSMITGVKRTTGQHPGGLMIIPKDKEVYDFTPIQYPANDR 1000 
NAGVFTTHFAYETIHDDLVKIDALGHDDPTFIKMLKDLTGIDPMTIPMDD 
PDTLAIFSSVKPLGVDPVELESDVGTYGIPEFGTEFVRGMLVETRPKSFA 1100 
ELVRISGLSHGTDVWLNNARDWINLGYAKLSEVISCRDDIMNFLIHKGME 
PSLAFKIMENVRKGKGITEEMESEMRRLKVPEWFIESCKRIKYLFPKAHA 12 00 
VAYVSMAFRIAYFKVHYPLQFYAAYFTIKGDQFDPVLVLRGKEAIKRRLR 
ELKAMPAKDAQKKNEVSVLEVALEMILRGFSFLPPDIFKSDAKKFLIEGN 1300 
SLRIPFNKLPGLGDSVAESIIRAREEKPFTSVEDLMKRTKVNKNHIELMK 
SLGVLGDLPETEQFTLF 1367 
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GTGCTCGCCATGATATGGAACGACACCGTTTTTTGCGTCGTAGACACAGA 
AACCACGGGAACCGATCCCTTTGCCGGAGACCGGATAGTTGAAATAGCCG 
CTGTTCCTGTCTTCAAGGGGAAGATCTACAGAAACAAAGCGTTTCACTCT 
CTCGTGAATCCCAGAATAAGAATCCCTGCGCTGATTCAGAAAGTTCACGG 
TATCAGCAACATGGACATCGTGGAAGCGCCAGACATGGACACAGTTTACG 
ATCTTTTCAGGGATTACGTGAAGGGAACGGTGCTCGTGTTTCACAACGCC 
AACTTCGACCTCACTTTTCTGGATATGATGGCAAAGGAAACGGGAAACTT 
TCCAATAACGAATCCCTACATCGACACACTCGATCTTTCAGAAGAGATCT 
TTGGAAGGCCTCATTCTCTCAAATGGCTCTCCGAAAGACTTGGAATAAAA 
ACCACGATACGGCACCGTGCTCTTCCAGATGCCCTGGTGACCGCAAGAGT 
TTTTGTGAAGCTTGTTGAATTTCTTGGTGAAAACAGGGTCAACGAATTCA 
TACGTGGAAAACGGGGG 
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MLAMIWNDTVFCVVDTETTGTDPFAGDRIVEIAAVPVFKGKIYRNKAFHS 
LVNPRIRIPALIQKVHGISNMDIVEAPDMDTVYDLFRDYVKGTVLVFHNA 
NFDLTFLDMMAKETGNFPITNPYIDTLDLSEEIFGRPHSLKWLSERLGIK 
TTIRHRALPDALVTARVFVKLVEFLGENRVNEFIRGKRG 
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GTGGAAGTTCTTTACAGGAAGTACAGGCCAAAGACTTTTTCTGAGGTTGT 

CAATCAGGATCATGTGAAGAAGGCAATAATCGGTGCTATTCAGAAGAACA 100 

GCGTGGCCCACGGATACATATTCGCCGGTCCGAGGGGAACGGGGAAGACT . 

ACTCTTGCCAGAATTCTCGCAAAATCCCTGAACTGTGAGAACAGAAAGGG " 200 

AGTTGAACCCTGCAATTCCTGCAGAGCCTGCAGAGAGATAGACGAGGGAA 

CCTTCATGGACGTGATAGAGCTCGACGCGGCCTCCAACAGAGGAATAGAC 300 

GAGATCAGAAGAATCAGAGACGCCGTTGGATACAGGCCGATGGAAGG.TAA 

ATACAAAGTCTACATAATAGACGAAGTTCACATGCTCACGAAAGAAGCCT 4 00 

TCAACGCGCTCCTCAAAACACTCGAAGAACCTCCTTCCCACGTCGTGTTC 

GTGCTGGCAACGACAAACCTTGAGAAGGTT-CCTCCCACGATT ATCTCGAG 500 

ATGTCAGGTTTTCGAGTTCAGAAACATTCCCGACGAGCTCATCGAAAAGA 

GGCTCCAGGAAGTTGCGGAGGCTGAAGGAATAGAGATAGACAGGGAAGCT 600 * 

CTGAGCTTCATCGCAAAAAGAGCCTCTGGAGGCTTGAGAGACGCGCTCAC 

CATGCTCGAGCAGGTGTGGAAGTTCTCGGAAGGAAAGATAGATCTCGAGA 70 0 

CGGTACACAGGGCGCTCGGGTTGATACCGATACAGGTTGTTCGCGATTAC 

GTGAACGCTATCTTTTCTGGTGATGTGAAAAGGGTCTTCACCGTTCTCGA 8 00 

CGACGTCTATTACAGCGGGAAGGACTACGAGGTGCTCATTCAGGAAGCAG 

TCGAGGATCTGGTCGAAGACCTGGAAAGGGAGAGAGGGGTTTACCAGGTT 900 

TCAGCGAACGATATAGTTCAGGTTTCGAGACAACTTCTGAATCTTCTGAG 

AGAGATAAAGTTCGCCGAAGAAAAACGACTCGTCTGTAAAGTGGGTTCGG 1000 

CTTACATAGCGACGAGGTTCTCCACCACAAACGTTCAGGAAAACGATGTC 

AG AG AAAAAAAC G AT AAT T C AAAT G T AC AG C AG AAAG AAG AG AAG AAAG A 1100 

AACGGTGAAGGCAAAAGAAGAAAAACAGGAAGACAGCGAGTTCGAGAAAC 

GCTTCAAAGAACTCATGGAAGAACTGAAAGAAAAGGGCGATCTCTCTATC 1200 

TTTGTCGCTCTCAGCCTCTCAGAGGTGCAGTTTGACGGAGAAAAGGTGAT 

TATTTCTTTTGATTCATCGAAAGCTATGCATTACGAGTTGATGAAGAAAA 1300 

AACTGCCTGAGCTGGAAAACATTTTTTCTAGAAAACTCGGGAAAAAAGTA 

G AAG T T G AAC T T C G AC T GAT G G G AAAAG AAG AAAC AAT C GAG AAG G T T T C 14 00 

T C AG AAG AT C C T GAG AT T G T T T G AAC AG G AG GG A 
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ME VL YRKYRPKT FS E VVNQDH VKKAI I GAIQKN SVAHGY I FAG PRGTGKT 

TLARILAKSLNCENRKGVEPCNSCRACREIDEGTFMDVIELDAASNRGID 100 
EIRRIRDAVGYRPMEGKYKVYI IDEVHMLTKEAFNALLKTLEEPPSHWF 

VLATTNLEKVPPTI ISRCQVFEFRNIPDELIEKRLQEVAEAEGIEIDREA 2 00 
LSFIAKRASGGLRDALTMLEQVWKFSEGKIDLETVHRALGLIPIQWRDY 

VNAIFSGDVKRVFTVLDDVYYSGKDYEVLIQEAVEDLVEDLERERGVYQV 300 
SANDIVQVSRQLLNLLREIKFAEEKRLVCKVGSAYIATRFSTTNVQENDV 

REKNDNSNVQQKEEKKETVKAKEEKQEDSEFEKRFKELMEELKEKGDLSI 4 00 
FVALSLSEVQFDGEKVIISFDSSKAMHYELMKKKLPELENIFSRKLGKKV 

EVELRLMGKEETIEKVSQKILRLFEQEG 47 8 
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ATGAAAGTAACCGTCACGACTCTTGAATTGAAAGACAAAATAACCATCGC • 

CTCAAAAGCGCTCGCAAAGAAATCCGTGAAACCCATTCTTGCTGGATTTC /I 00 
TTTTCGAAGTGAAAGATGGAAATTTCTACATCTGCGCGACCGATGTCGAG . ** ""'*' 

ACCGGAGTCAAAGCAACCGTGAATGCCGCTGAAATCTCCGGTGAGGCACG 200 
TTTTGTGGTACCAGGAGATGTCATTCAGAAGATGGTCAAGGTTCTCCCAG 

ATGAGATAACGGAACTTTCTTTAGAGGGGGATGCTCTTGTTATAAGTTCT 300 
GGAAGCACCGTTTTCAGGATCACCACCATGCCGGCGGACGAATTTCCAGA 

GATAACGCCTGCCGAGTCTGGAATAACCTTCGAAGTTGACAGTTCGCTCC 4 00 
TCGAGGAAATGGTTGAAAAGGTCATCTTCGCCGCTGCCAAAGACGAGTTC 

ATGCGAAATCTGAATGGAGTTTTCTGGGAACTCCACAAGAATCTTCTCAG 500- 
GCTGGTTGCAAGTGATGGTTTCAGACTTGCACTTGCTGAAGAGCAGATAG 

AAAACGAGGAAGAGGCGAGTTTCTTGCTCTCTTTGAAGAGGATGAAAGAA 600 
GTTCAAAACGTGCTGGACAACACAACGGAGCCGACTATAACGGTGAGGTA 

CGATGGAAGAAGGGTTTCTCTGTCGACAAATGATGTAGAAACGGTGATGA 7 00 
GAGTGGTCGACGCTGAATTTCCCGATTACAAAAGGGTGATCCCCGAAACT 

TTCAAAACGAAAGTGGTGGTTTCCAGAAAAGAACTCAGGGAATCTTTGAA 8 00 
GAGGGTGATGGTGATTGCCAGCAAGGGAAGCGAGTCCGTGAAGTTCGAAA 

TAGAAGAAAACGTTATGAGACTTGTGAGCAAGAGCCCGGATTATGGAGAA 900 
GTGGTCGATGAAGTTGAAGTTCAAAAAGAAGGGGAAGATCTCGTGATCGC 

TTTCAACCCGAAGTTCATCGAGGACGTTTTGAAGCACATTGAGACTGAAG 1000 
AAATCGAAATGAACTTCGTTGATTCTACCAGTCCATGTCAGATAAATCCA 

CTCGATATTTCTGGATACCTTTACATAGTGATGCCCATCAGACTGGCA 1098 
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MKVTVTTLELKDKITIASKALAKKSVKPILAGFLFEVKDGNFYICATDLE 
TGVKATVNAAEISGEARFWPGDVIQKMVKVLPDEITELSLEGDALVISS 100 
GSTVFRITTMPADEFPEITPAESGITFEVDTSLLEEMVEKVIFAAAKDEF 
MRNLNGVFWELHKNLLRLVASDGFRLALAEEQIENEEEASFLLSLKSMKE 200 
VQNVLDNTTEPTITVRYDGRRVSLSTNDVETVMRWDAEFPDYKRVIPET 
FKTKWVSRKELRESLKRVMVIASKGSESVKFEIEENVMRLVSKSPDYGE 300 
WDEVEVQKEGEDLVIAFNPKFIEDVLKHIETEEIEMN FVDSTSPCQINP 
LDISGYLYIVMPIRLA 366 
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ATGCCAGTCACGTTTCTCACAGGTACTGCAGAAACTCAGAAGGAAGAATT 
GATAAAGAAACTCCTGAAGGATGGTAACGTGGAGTACATAAGGATCCATC 
CGGAGGATCCCGACAAGATCGATTTCATAAGGTCTTTACTCAGGACAAAG 
AC GAT C T T T T C C AAC AAG AC GAT CAT T G AC AT C G T C AAT TT C.G AT G AGT G 
GAAAGCACAGGAGCAGAAGCGTCTCGTTGAACTTTTGAAAAACGTACCGG 
AAGACGTTCATATCTTCATCCGTTCTCAAAAAACAGGTGGAAAGGGAGTA 
GCGCTGGAGCTTCCGAAGCCATGGGAAACGGACAAGTGGCTTGAGTGGAT 
AGAAAAGCGCTTCAGGGAGAATGGTTTGCTCATCGATAAAGATGCCCTTC 
AGCTGTTTTTCTCCAAGGTTGGAACGAACGACCTGATCATAGAAAGGGAG 
ATTGAAAAACTGAAAGCTTATTCCGAGGACAGAAAGATAACGGTAGAAGA 
CGTGGAAGAGGTCGTTTTTACCTATCAGACTCCGGGATACGATGATTTTT 
GCTTTGCTGTTTCCGAAGGAAAAAGGAAGCTCGCTCACTCTCTTCTGTCG 
CAGCTGTGGAAAACCACAGAGTCCGTGGTGATTGCCACTGTCCTTGCGAA 
TCACTTCTTGGATCTCTTCAAAATCCTCGTTCTTGTGACAAAGAAAAGAT 
ACTACACCTGGCCTGATGTGTCCAGGGTGTCCAAAGAGCTGGGAATTCCC 
GTTCCTCGTGTGGCTCGTTTCCTCGGTTTCTCCTTTAAGACCTGGAAATT 
C AAGG T GAT G AAC C AC C T C C T C T AC T AC GAT G T G AAG AAG GT T AG AAAG A 
TACTGAGGGATCTCTACGATCTGGACAGAGCCGTGAAAAGCGAAGAAGAT 
CCAAAACCGTTCTTCCACGAGTTCATAGAAGAGGTGGCACTGGATGTATA 
T T C T C T T C AG AG AG AT G AAG AA 
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MPVTFLTGTAETQKEELIKKLLKDGNVEYIRIHPEDPDKIDFIRSLLRTK. 
TIFSNKTIIDIVNFDEWKAQEQKRLVELLKNVPEDVHIFIRSQKTGGKGV 
ALELPKPWETDKWLEWIEKRFRENGLLIDKDALQLFFSKVGTNDLIIERE 
IEKLKAYSEDRKITVEDVEEWFTYQTPGYDDFCFAVSEGKRKLAHSLLS 
QLWKTTESWIATVLANHFLDLFKILVLVTKKRYYTWPDVSRVSKELGIP 
VPRVARFLGFSFKTWKFKVMNHLLYYDVKKVRKILRDLYDLDRAVKSEED 
PKPFFHEFIEEVALDVYSLQRDEE 
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AT G AAC GAT T T GAT C AG AAA G T AC G C T AAAG AT C AAC T GG AAAC T T T G AA 

AAGGATCATAGAAAAGTCTGAAGGAATATCCATCCTCATAAATGGAGAAG 100' 

ATCTCTCGTATCCGAGAGAAGTATCCCTTGAACTTCCCGAGTACGTGGAG ' 

AAATTTCCCCCGAAGGCCTCGGATGTTCTGGAGATAGATCCCGAGGGGGA. .200 

GAACATAGGCATAGACGACATCAGAACGATAAAGGACTTCCTGAACTACA 

GCCCCGAGCTCTACACGAGAAAGTACGTGATAGTCCACGACTGTGAAAGA 300 

ATGACCCAGCAGGCGGCGAACGCGTTTCTGAAGGCCCTTGAAGAACCACC 

AGAATACGCTGTGATCGTTCTGAACACTCGCCGCTGGCATTATCTACTGC 4 00 

CGACGATAAAGAGCCGAGTGTTCAGAGTGGTTGTGAACGTTCCAAAGGAG 

T T C AG AG AT C T C GT G AAAG AG AAAAT AG GAG AT C T C T GGG AGG AAC T T C C 5.00 

ACTTCTTGAGAGAGACTTCAAAACGGCTCTCGAAGCCTACAAACTTGGTG 

CGGAAAAACTTTCTGGATTGATGGAAAGTCTCAAAGTTTTGGAGACGGAA 600 

AAACTCTTGAAAAAGGTCCTTTCAAAAGGCCTCGAAGGTTATCTCGCATG 

TAGGGAGCTCCTGGAGAGATTTTCAAAGGTGGAATCGAAGGAATTCTTTG 700 

CGCTTTTTGATCAGGTGACTAACACGATAACAGGAAAAGACGCGTTTCTT 

TTGATCCAGAGACTGACAAGAATCATTCTCCACGAAAACACATGGGAAAG 800 

CGTTGAAGATCAAAAAAGCGTGTCTTTCCTCGATTCAATTCTCAGGGTGA 

AGATAGCGAATCTGAACAACAAACTCACTCTGATGAACATCCTCGCGATA 900 

CACAGAGAGAGAAAGAGAGGTGTCAACGCTTGGAGC 
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MNDLIRKYAKDQLETLKRIIEKSEGISILINGEDLSYPREYSLELPEYVE 
KFPPKASDVLEIDPEGENIGIDDIRTIKDFLNYSPELYTRKYVIVHDCER 100 
MTQQAANAFLKALEEPPEYAVIVLNTRRWHYLLPTIKSRVFRWVNVPKE 
FRDLVKEKIGDLWEELPLLERDFKTALEAYKLGAEKLSGLMESLKVLETE 2 00 
KLLKKVLSKGLEGYLACRELLERFSKVESKEFFALFDQVTNTITGKDAFL 
LIQRLTRI ILHENTWESVEDKSVSFLDSILRVKIANLNNKLTLMNILAIH 300 
RERKRGVNAWS 
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ATGTCTTTCTT C AAC AAG AT CAT AC T CAT AG G AAG AC T C G T GAG AG AT C C 
CGAAGAGAGATACACGCTCAGCGGAACTCCAGTCACCACCTTCACCATAG 100 
CGGTGGACAGGGTTCCCAGAAAGAACGCGCCGGACGACGCTCAAACGACT 
GATTTCTTCAGGATCGTCACCTTTGGAAGACTGGCAGAGTTCGCTAGAAC . 2 00 
CTATCTCACCAAAGGAAGGCTCGTTCTCGTCGAAGGTGAAATGAGAATGA 
GAAGATGGGAAACACCCACTGGAGAAAAGAGGGTATCTCCGGAGGTTGTC 300 
GCAAACGTTGTTAGATTCATGGACAGAAAACCTGCTGAAACAGTTAGCGA 
GACTGAAGAGGAGCTGGAAATACCGGAAGAAGACTTTTCCAGCGATACCT 4 00 
T C AG T G AAG AT G AAC C AC CAT T T : ; 
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MSFFNKIILIGRLVRDPEERYTLSGTPVTTFTIAVDRVPRKNAPDDAQTT 
DFFRIVTFGRLAEFARTYLTKGRLVLVEGEMRMRRWETPTGEKRVSPEW 100 
ANVVRFMDRKPAETVSETEEELEIPEEDFSSDTFSEDEPPF 
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atgcgtgttcccccgcacaacttagaggccgaagttgctgtgctcggaag 

catattgatagatccgtcggtaataaacgacgttcttgaaattttgagcc 100 
ac g aag at t t c tat c t g aaaaaac ac c aac ac at c t t c ag ag c gat g g aa 

gagctttacgacgaaggaaaaccggtggacgtggtttccgtctgtgacaa 200 
gcttcaaagcatgggaaaactcgaggaagtaggtggagatgtggaagtgg 

cccagctcgctgaggctgtgcccagttctgcacacgcacttcactacgcg 300 
gagatcgtcaaggaaaaatccattctgaggaaactcattgagatctccag 

aaaaatctcagaaagtgcctacatggaagaagatgtggagatcctgctcg 4 00 
ac aac g c ag aaaag at gat c t t c gag at c t c ag ag at g aaaac g ac aaaa 

tcctacgatcatctgagaggcatcatgcaccgggtgtttgaaaacctgga 500 
gaacttcagggaaagagccaaccttatagaacccggtgtgctcataacgg 

gactaccaacgggattcaaaagtctggacaaacagaccacagggttccac 600 
agctccgatctggtgataatagcagcgagaccctccatgggaaaaacctc^ 

cttcgcactctcaatagcgaggaacatggctgtcaatttcgaaatccccg" 7 0 0 
tcggaatattcagtctcgagatgtccaaggaacagctcgctcaaagacta 

ctcagcatggagtccggtgtggatctttacagcatcagaacaggatacct 800 
ggatcaggagaagtgggaaagactcacaatagcggcttctaaactctaca 

aagcacccatagttgtggacgatgagtcactcctcgatccgcgatcgttg 900 
agggcaaaagcgagaaggatgaaaaaagaatacgatgtaaaagccatttt 

tgtcgactatctccagctcatgcacctgaaaggaagaaaagaaagcagac 10 00 
agcaggagatatccgagatctcgagatctctgaagctccttgcgagggaa 

ctcgacatagtggtgatagcgctttcacagctttcgagggccgtagaaca 1100 
gagagaagacaaaagaccgaggctgagtgacctcagggaatccggtgcga 

tagaacaggacgcagacacagtcatcttcatctacagggaggaatattac 12 00 
aggagcaaaaaatccaaagaggaaagcaagcttcacgaacctcacgaagc 

tgaaatcataataggtaaacagagaaacggtcccgttggaacgatcactc 13 00 
tgatcttcgaccccagaacggttacgttccatgaagtcgatgtggtgcat 

TCA ' ' 1353 
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MRVPPHNLEAEVAVLGSILIDPSVINDVLEILSHEDFYLKKHQHIFRAME 
ELYDEGKPVDVVSVCDKLQSMGKLEEVGGDLEVAQLAEAVPSSAHALHYA 10 0 
EIVKEKSILRKLIEISRKISESAYMEEDVEILLDNAEKMIFEISEMKTTK 
SYDHLRGIMHRVFENLENFRERANLIEPGVLITGLPTGFKSLDKQTTGFH 200 
S S DLVI I AARPSMGKT S FALS I ARNMAVN FE I PVG I FSLEMSKEQLAQRL 
LSMESGVDLYSIRTGYLDQEKWERLTIAASKLYKAPIWDDESLLDPRSL 300 
RAKARRMKKEYDVKAIFVDYLQLMHLKGRKESRQQEISEISRSLKLLARE 
LDIVVIALSQLSRAVEQREDKRPRLSDLRESGAIEQDADTVIFIYREEYY 4 00 
RSKKSKEESKLHEPHEAEII IGKQRNGPVGTITLIFDPRTVTFHEVDWH 
S 451 
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GTGATTCCTCGAGAGGTCATCGAGGAAATAAAAGAAAAGGTTGACATCGT 

AGAGGTCATTTCCGAGTACGTGAATCTTACCCGGGTAGGTTCCTCCTACA.... . 10,0 

GGGCTCTCTGTCCCTTTCATTCAGAAACCAATCCTTCTTTCTACGTTCAT 

CCGGGTTTGAAGATATACCATTGTTTCGGCTGCGGTGCGAGTGGAGACGT ■ • 200 

CATCAAATTTCTTCAAGAAATGGAAGGGATCAGTTTCCAGGAAGCGCTGG 

AAAGACTTGCCAAAAGAGCTGGGATTGATCTTTCTCTCTACAGAACAGAA 300 

GGGACTTCTGAATACGGAAAATACATTCGTTTGTACGAAGAAACGTGGAA 

AAGGTACGTCAAAGAGCTGGAGAAATCGAAAGAGGCAAAAGACTATTTAA 400 

AAAGCAGAGGCTTCTCTGAAGAAGATATAGCAAAGTTCGGCTTTGGGTAC 

GTCCCCAAGAGATCCAGCATCTCTATAGAAGTTGCAGAAGGCATGAACAT 500 

AACACTGGAAGAACTTGTCAGATACGGTATCGCGCTGAAAAAGGGTGATC 

GATTCGTTGATAGATTCGAAGGAAGAATCGTTGTTCCAATAAAGAACGAC 600 

AGTGGTCATATTGTGGCTTTTGGTGGGCGTGCTCTCGGCAACGAAGAACC 

GAAGTATTTGAACTCTCCAGAGACCAGGTATTTTTCGAAGAAGAAGACCC 7 00 

TTTTTCTCTTCGATGAGGCGAAAAAAGTGGCAAAAGAGGTTGGTTTTTTC 

GTCATCACCGAAGGCTACTTCGACGCGCTCGCATTCAGAAAGGATGGAAT 800 

ACCAACGGCGGTCGCTGTTCTTGGGGCGAGTCTTTCAAGAGAGGCGATTC 

TAAAACTTTCGGCGTATTCGAAAAACGTCATACTGTGTTTCGATAATGAC 900 

AAAGCAGGCTTCAGAGCCACTCTCAAATCCCTCGAGGATCTCCTAGACTA 

CGAATTCAACGTGCTTGTGGCAACCCCCTCTCCTTACAAAGACCCAGATG 1000 

AACTCTTTCAGAAAGAAGGAGAAGGTTCATTGAAAAAGATGCTGAAAAAC 

TCGCGTTCGTTCGAATATTTTCTGGTGACGGCTGGTGAGGTCTTCTTTGA 1100. 

CAGGAACAGCCCCGCGGGTGTGAGATCCTACCTTTCTTTCCTCAAAGGTT 

GGGTCCAAAAGATGAGAAGGAAAGGATATTTGAAACACATAGAAAATCTC .12 00 

G T G AAT G AG GT T T CAT CTTCTCTC C AG AT AC C AG AAAAC C AG AT T T T G AA 

CTTTTTTGAAAGCGACAGGTCTAACACTATGCCTGTTCATGAGACCAAGT 1 3 00 

CGTCAAAGGTTTACGATGAGGGGAGAGGACTGGCTTATTTGTTTTTGAAC 

TACGAGGATTTGAGGGAAAAGATTCTGGAACTGGACTTAGAGGTACTGGA 14 00 

AGATAAAAACGCGAGGGAGTTTTTCAAGAGAGTCTCACTGGGAGAAGATT 

T G AAC AAAG T CAT AG AAAAC T T C C C AAAAG AG C T G AAA G AC T G GAT T T T T 1500 

GAGACAATAGAAAGCATTCCTCCTCCAAAGGATCCCGAGAAATTCCTCGG 

TGACCTCTCCGAAAAGTTGAAAATCCGACGGATAGAGAGACGTATCGCAG 1600 

AAATAGATGATATGATAAAGAAAGCTTCAAACGATGAAGAAAGGCGTCTT 

CTTCTC.TCTATGAAAGTGGATCTCCTCAGAAAAATAAAGAGGAGG 1695 
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MIPREVIEEIKEKVDIVEVISEYVNLTRVGSSYRALCPFHSETNPSFYVH 

PGLKIYHCFGCGASGDVIKFLQEMEGISFQEALERLAKRAGIDLSLYRTEv . 100. 

GTSEYGKYIRLYEETWKRYVKELEKSKEAKDYLKSRGFSEEDIAKFGFGY 

VPKRSSISIEVAEGMNITLEELVRYGIALKKGDRFVDRFEGRIWPIKND 200 

SGHIVAFGGRALGNEEPKYLNSPETRYFSKKKTLFLFDEAKKVAKEVGFF 

VITEGYFDALAFRKDGIPTAVAVLGASLSREAILKLSAYSKNVILCFDND 300 

KAGFRATLKSLEDLLDYEFNVLVATPSPYKDPDELFQKEGEGSLKKMLKN 

SRS FE YFLVTAGEVFFDRNS PAGVRS YLS FLKGWVQKMRRKGYLKHI ENL 4 00 

VNEVSSSLQIPENQILNFFESDRSNTMPVHETKSSKVYDEGRGLAYLFLN 

YEDLREKILELDLEVLEDKNAREFFKRVSLGEDLNKVIENFPKELKDWIF 500 

ETIESIPPPKDPEKFLGDLSEKLKIRRIERRIAEIDDMIKKAiSNDEERRL 

LLSMKVDLLRKIKRR 565 
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ATGGCTCTACACCCGGCTCACCCTGGGGCAATAATCGGGCACGAGGCCGT 
TCTCGCCCTCCTTCCCCGCCTCACCGCCCAGACCCTGCTCTTCTCCGGCC 100 
CCGAGGGGGTGGGGCGGCGCACCGTGGCCCGCTGGTACGCCTGGGGGCTC 
AACCGCGGCTTCCCCCCGCCCTCCCTGGGGGAGCACCCGGACGTCCTCGA 200 
GGTGGGGCCCAAGGCCCGGGACCTCCGGGGCCGGGCCGAGGTGCGGCTGG 
AGGAGGTGGCGCCCCTCTTGGAGTGGTGCTCCAGCCACCCCCGGGAGCGG 300 
GTGAAGGTGGCCATCCTGGACTCGGCCCACCTCCTCACCGAGGCCGCCGC 
CT^CGCCCTCCTCAAGCTCCTGGAGGAGCCCCCTTCCTACGCCCGCATCG 4 00 
TCCTCATCGCCCCAAGCCGCGCCACCCTCCTCCCCACCCTGGCCTCCCGG 
GCCACGGAGGTGGCATTCGCCCCCGTGCCCGAGGAGGCCCTGCGCGCCCT 500 
CACCCAGGACCCGGAGCTCCTCCGCTACGCCGCCGGGGCCCCGGGCCGCC 
TCCTTAGGGCCCTCCAGGACCCGGAGGGGTACCGGGCCCGCATGGCCAGG 600 
GCGCAi\AGGGTCCTGAAAGCCCCGCCCCTGGAGCGCCTCGCTTTGCTTCG 
GGAGCTTTTGGCCGAGGAGGAGGGGGTCCACGCCCTCCACGCCGTCCTAA 700 
AGCGCCCGGAGCACCTCCTTGCCCTGGAGCGGGCGCGGGAGGCCCTGGAG 
GGGTACGTGAGCCCCGAGCTGGTCCTCGCCCGGCTGGCCTTAGACTTAGA 800 
GACA 
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MALHPAHPGAIIGHEAVLALLPRLTAQTLLFSGPEGVGRRTVARWYAWGL 
NRGFPPPSLGEHPDVLEVGPKARDLRGRAEVRLEEVAPLLEWCSSHPRER 100 
VKVAILDSAHLLTEAAANALLKLLEEPPSYARIVLTAPSRATLLPTLASR 
ATEVAFAPVPEEALRALTQDPELLRYAAGAPGRLLRALQDPEGYRARMAR 200 
AQRVLKAPPLERLALLRELLAEEEGVHALHAVLKRPEHLLALERAREALE 
GYVSPELVLARLALDLET 2 68 
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ATGCTGGACCTGAGGGAGGTGGGGGAGGCGGAGTGGAAGGCCCTAAAGCC 
CCTTTTGGAAAGCGTGCCCGAGGGCGTCCCCGTCCTCCTCCTGGACCCTA -100 
AGCCAAGCCCCTCCCGGGCGGCCTTCTACCGGAACCGGGAAAGGCGGGAG 
TTCCCCACCCCCAAGGGGAAGGACCTGGTGCGGCACCTGGAAAACCGGGG 20 0 
CAAGCGCCTGGGGCTCAGGCTCCCGGGCGGGGTGGCCCAGTACCTGGCCT 
CCCTGGAGGGGGACCTCGAGGCCCTGGAGCGGGAGCTGGAGAAGCTTGCC 300 
CTCCTCTCCCCACCCCTCACCCTGGAGAAGGTGGAGAAGGTGGTGGCCCT 
GAGGCCCCCCCTCACGGGCTTTGACCTGGTGCGCTCCGTjCCTGGAGAAGG 4 00 
ACCCCAAGGAGGCCCTCCTGCGCCTAGGCGGCCTCAAGGAGGAGGGGGAG 
GAGCCCCTCAGGCTCCTCGGGGCCCTCTCCTGGCAGTTCGCCCTCCTCGC 500 
CCGGGCCTTCTTCCTCCTCCGGGAAAACCCCAGGCCCAAGGAGGAGGACC 
TCGCCCGCCTCGAGGCCCACCCCTACGCCGCCCGCCGCGCCCTGGAGGCG 600 
GCGAAGCGCCTCACGGAAGAGGCCCTCAAGGAGGCCCTGGACGCCCTCAT 
GGAGGCGGAAAAGAGGGCCAAGGGGGGGAAAGACCCGTGGCTCGCCCTGG 7 0 0 r 
AGGCGGCGGTCCTCCGCCTCGCCCGTTGA 
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MVIAFTGDPFLAREALLEEARLRGLSRFTEPTPEALAQALAPGLFGGGGA 

MLDLREVGEAEWKALKPLLESVPEGVPVLLLDPKPSPSRAAFYRNRERRD 100 
FPTPKGKDLVRHLENRAKRLGLRLPGGVAQYLASLEGDLEALERELEKLA 

LLSPPLTLEKVEKWALRPPLTGFDLVRSVLEKDPKEALLRLGGLKEEGE 200 
EPLRLLGALSWQFALLARAFFLLRENPRPKEEDLARLEAHPYAARRALEA 

AKRLTEEALKEALDALMEAEKRAKGGKDPWLALEAAVLRLAR 2 92' 



FIG. 75 



ATGGCTCGAGGCCTGAACCGCGTTTTCCTCATCGGCGCCCTCGCCACCCG 
GCCGGACATGCGCTACACCCCGGCGGGGCTCGCCATTTTGGACCTGACCC 
TCGCCGGTCAGGACCTGCTTCTTTCCGATAACGGGGGGGAACCGGAGGTG 
TGCTGGTACCACCGGGTGAGGCTCTTAGGCCGCCAGGCGGAGATGTGGGG 
CGACCTCTTGGACCAAGGGCAGCTCGTCTTCGTGGAGGGCCGCCTGGAGX 
ACCGCCAGTGGGAAAGGGAGGGGGAGAAGCGGAGCGAGCTCCAGATCCGG 
GCCGACTTCCGGACCC'CCTGGACGACCGGGGGAAGAAGCGGGCGGAGGAC 
AGCCGGGGCCAGCCCAGGCTCCGCGCCGCCCTGAACCAGGTCTTCCTCAT 
GGGCAACCTGACCCGGGACCCGGAACTCCGCTACACCCCCCAGGGCACCG 
CGGTGGCCCGGCTGGGCCTGGCGGTGAACGAGCGCCGCCAGGGGG.CGGAG 
GAGCGCACCCACTTCGTGGAGGTTCAGGCCTGGCGCGACCTGGCGGAGTG 
GGCCGCCGAGCTGAGGAAGGGCGACGGCCTTTTCGTGATCGGCAGGTTGG 
TGAACGACTCCTGGACCAGCTCCAGCGGCGAGCGGCGCTTCCAGACCCGT 
GTGGAGGCCCTCAGGCTGGAGCGCCCCACCCGTGGACCTGCCCAGGCCTG 
CCCAGGCCGGCGGAACAGGTCCCGCGAAGTCCAGACGGGTGGGGTGGACA 
TTGACGAAGGCTTGGAAGACTTTCCGCCGGAGGAGGATTTGCCGTTTTGA 
GCACGAA 
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MARGLNRVFLIGALATRPDMRYTPAGLAILDLTLAGQDLLLSDNGGEPEV 
SWYHRVRLLGRQAEMWGDLLDQGQLVFVEGRLEYRQWEREGEKRSELQIR 
ADFLDPLDDRGKKRAEDSRGQPRLRAALNQVFLMGNLTRDPELRYTPQGT 
AVARLGLAVNERRQGAEERTHFVEVQAWRDLAEWAAELRKGDGLFVIGRL 
VNDSWTSSSGERRFQTRVEALRLERPTRGPAQACPGRRNRSREVQTGGVD 
IDEGLEDFPPEEDLPF 
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AATTCCGACATTTCAATTGAATCGTTTATTCCGCTTGAAAAAGAAGGCAA 

GTTGCTCGTTGATGTGAAAAGACCGGGGAGCATCGTACTGCAGGCGCGPT.. 

TTTTCTCTGAAATCGTGAAAAAACTGCCGCAACAAACGGTGGAAATCGAA 

ACGGAAGACAACTTTTTGACGATCATCCGCTCGGGGCACTCAGAATTCCG 

CCTCAATGGGCTAAACGCCGACGAATATCCGCGCCTGCCGCAAATTGAAG 

AAGAAAACGTGTTTCAAATCCCGGCTGATTTATTGAAAACCGTGATTCGG 

CAAACGGTGTTCGCCGTTTCTACATCGGAAACGCGCCCAATCTTGACAGG 

TGTCAACTGGAAAGTTGAACATGGCGAGCTTGTCTGCACAGCGACCGACA 

GTCATCGCTTAGCCATGCGCAAAGTGAAAATTGAGTCGGAAAATGAAGTA 

TCATACAACGTCGTCATCCCTGGAAAAAGTCTTAATGAGCTCAGCAAAAT 

TTTGGATGACGGCAACCACCCGGTGGACATCGTCATGACAGCCAATCAAG 

TGCTATTTAAGGCCGAGCACCTTCTCTTCTTTTCCCGGCTGCTTGACGGC 

AACTATCCGGAGACGGCCCGCTTGATTCCAACAGAAAGCAAAACGACCAT 

GATCGTCAATGCAAAAGAGTTTCTGCAGGCAATCGACCGAGCGTCCTTGC 

TTGCTCGAGAAGGAAGGAACAACGTTGTGAAACTGACGACGCTTCCTGGA 

GGAATGCTCGAAATTTCTTCGATTTCTCCGAGATCGGGAAAGTGACGGAG 

CAGCTGCAAACGGAGTCTCTTGAAGGGGAAGAGTTGAACATTTCGTTCAG 

CGCGAAATATATGATGGACGCGTTGCGGGCGCTTGATGGT^ACAGACATTT 

CAAATCAGCTTCACTGGGGCCATGCGGCCGTTCCTGTTGCGCCCGCTTCA 

ACCGATTCGATGCTTCAGCTCATTTTGCCGGTGAGAACATAT 



FIG. 78 



NSDISIIESFIPLEKEGKLLVDVKRPGSIVLQARFFSEIVKKLPQQTVEI 
ETEDNFLTIIRSGHSEFRLNGLNADEYPRLPQIEEENVFQIPADLLKTVI 
RQTVFAVSTSETRPILTGVNWKVEHGELVCTATDSHRLAMRKVKIIESEN 
EVSYNVVIPGKSLNELSKI ILDDGNHPVDIVMTANQVLFKAEHLLFFSRL 
LDGNYPETARLIPTESKTTMIVNAKEFLQAIDRASLLAREGRNNWKLTT 
LPGGMLEISSISPEIGKVTEQLQTESLEGEELNISFSAKYMMDALRALDG 
TDIQISFTGAMRPFLLRPLHTDSMLQLILPVRTY 
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ATGATTAACCGCGTCATTTTGGTCGGCAGGTTAACGAGAGATCCGGAGTT 
GCGTTACACTCCAAGCGGAGTGGCTGTTGCCACGTTTACGCTCGCGGTCA' ,100 
ACCGTCCGTTTACAAATCAGCAGGGCGAGCGGGAAACGGATTTTATTCAA" = 
TGTGTCGTTTGGCGCCGCCAGGCGGAAAACGTCGCCAACTTT.TTGAAAAA 200 
GGGGAGCTTGGCTGGTGTCGATGGCCGACTGCAAACCCGCAGCTATGAAA 
ATCAAGAAGGTCGGCGTGTGTACGTGACGGAAGTGGTGGCTGATAGCGTC 300 
CAATTTCTTGAGCCGAAAGGAACGAGCGAGCAGCGAGGGGCGACAGCAGG 
CGGCTACTATGGGGATCCATTCCCATTCGGGCAAGATCAGAACCACCAAT 400 
ATCCGAACGAAAAAGGGTTTGGCCGCATCGATGACGATCCTTTCGCCAAT 
GACGGCCAGCCGATCGATATTTCTGATGATGATTTGCCGTTT 4 92 



FIG. 80 



MINRVILVGRLTRDPELRYTPSGVAVATFTLAVNRPFTNQSYENQEGRRV 
YVTEWADSVQFLEPKGTSEQRGATAGGYYQGERETDFIQCWWRRQAEN 100 
VANFLKKGSLAGVDGRLQTRGDPFPFGQDQNHQYPNEKGFGRIDDDPFAN 
DGQPIDISDDDLPF 164 



FIG. 81 



ATGCTGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTCTCCCCT 
TTATTTATTATACGGCAATGAGCCGTTTTTATTAACGGAAACGTATGAGC 100 
GATTGGTGAACGCAGCGCTTGGCCCCGAGGAGCGGGAGTGGAACTTGGCT 
■GTGTACGACTGCGAGGAAACGCCGATCGAGGCGGCGCTTGAGGAGGCCGA 200 
GACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGCATCCAT 
ATTTTTTTACGTCTGAAAAAGAGAAGGAGATCGAACATGATTTGGCGAAG 300 
CTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCGATCGTCGTCTTTTT 
CGCGCCGTACGAGAAGCTTGATGAGCGAAAAAAAATTACGAAGCTCGCCA 4 00 
AAGAGCAAAGCGAAGTCGTCATCGCCGCCCCGCTCGCCGAAGCGGAGCTG 
CGTGCCTGGGTGCGGCGCCGCATCGAGAGCCAAGGGGCGCAAGCAAGCGA 500 
CGAGGCGATTGATGTCCTGTTGCGGCGGGCCGGGACGCAGCTTTCCGGCT 
TGGCGAATGAAATCGATAAATTGGCCCTGTTTGCCGGATCGGGCGGAACC 600 
ATCGAGGCGGCGGCGGTTGAGCGGCTTGTCGCCCGCACGCCGGAAGAAAA 
CGTATTTGTGCTTGTCGAGCAAGTGGCGAAGCGCGACATTCCAGCAGCGT 700 
T G C AG AC G T T T TAT GAT C T G C T T G AAAAC AAT G AAG AG C C GAT C AAAAT T 
TTGGCGTTGCTCGCCGCCCATTTCCGCTTGCTTTCGCAAGTGAAATGGCT 8 00 
TGCCTCCTTAGGCTACGGACAGGCGCAAATTGCTGCGGCGCTCAAGGTGC 
ACCCGTTCCGCGTCAAGCTCGCTCTTGCTCAAGCGGCCCGCTTCGCTGAC 900 
GGAGAGCTTGCTGAGGCGATCAACGAGCTCGCTGACGCGGATTACGAAGT 
GAAAAGCGGGGCGGTCGATCGCCGGTTGGCCGTTGAGCTGCTTCTGATGC 1000 
GCTGGGGCGCCCGCCCGGCGCAAGCGGGGCGCCACGGCCGGCGG 



FIG. 82 



MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLA 
VYDCEETPIEAALEEAETVPFFGERRVILIKHPYFFTSEKEKEIEHDLAK 100 
LEAYLKAPSPFSIVVFFAPYEKLDERKKITKLAKEQSEVVIAAPLAEAEL 
RAWVRRRIESQGAQASDEAI DVLLRRAGTQLSALANEI DKLALFAGSGGT 200 
IEAAAVERLVARTPEENVFVLVEQVAKRDIPAALQTFYDLLENNEEPIKI 
LALLAAHFRLLSQVKWLASLGYGQAQIAAALKVHPFRVKLALAQAARFAD 300 
GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 



FIG. 83 



ATGCGATGGGAACAGCTAGCGAAACGCCAGCCGGTGGTGGCGAAAATGCT 
GCAAAGCGGCTTGGAAAAAGGGCGGATTTCTCATGCGTACTTGTTTGAGG 100 
GGCAGCGGGGGACGGGCAAAAAAGCGGCCAGTTTGTTGTTGGCGAMCGT ' 
TTGTTTTGTCTGTCCCCAATCGGAGTTTCCCCGTGTCTAGAGTGCCG.CAA - 200 
CTGCCGGCGCATCGACTCCGGCAACCACCCTGACGTCCGGGTGATCGGCG 
CAGATGGAGGATCAATCAAAAAGGAACAAATCGAATGGCTGCAGCAAGAG 300 
TTCTCGAAAACAGCGGTCGAGTCGGATAAAAAAATGTACATCGTTGAGCA 
CGCCGATCAAATGACGACAAGCGCTGCCAACAGCCTTCTGAAATTTTTGG 4 00 
AAGAGCCGCATCCGGGGACGGTGGCGGTATTGCTGACTGAGCAATACCAC 
CGCCTGCTAGGGACGATCGTTTCCCGCTGTCAAGTGCTTTCGTTCCGGCC 500 
GTTGCCGCCGGCAGAGCTCGCCCAGGGACTTGTCGAGGAGCACGTGCCGT 
TGCCGTTGGCGCTGTTGGCTGCCCATTTGACAAAGAGCTTCGAGGAAGCA . -60.0 
CTGGCGCTTGCCAAAGATAGTTGGTTTGCCGAGGCGCGAACATTAGTGCT 
ACAATGGTATGAGATGCTGGGCAAGCCGGAGCTGCAGCTTTTGTTTTTCA . 7 00 
TCCACGACCGCTTGTTTCCGCATTTTTTGGAAAGCCATCAGCTTGACCTT 
GGAC.TTG 757 



FIG. 84 



MRWEQLAKRQPWAKMLQSGLEKGRIS-HAYLFEGQRGTGKKAASLLLAKR 

LFCLSPIGVSPCLECRNCRRIDSGNHPDVRVIGPDGGSIKKEQIEWLQQE 100 

FSKTAVESDKKMYIVEHADQMTTSAANSLLKFLEEPHPGTVAVLLTEQYH 

RLLGTIVSRCQVLSFRPLPPAELAQGLVEEHVPLPLALLAAHLTNSFEEA 200 

LALAKDSWFAEARTLVLQWYEMLGKPELQLLFFIHDRLFPHFLESHQLDL 

GL 252 



FIG. 85 



GTGGCATACCAAGCGTTATATCGCGTGTTTCGGCCGCAGCGCTTTGCGGA 

CATGGTCGGCCAAGAACACGTGACCAAGACGTTGCAAAGCGCCCTGCTTC . 100 

AACATAAAATATCGCACGCTTACTTATTTTCCGGCCCGCGCGGTACAGGA 

AAAACGAGCGCAGCGAAAATTTTCGCCAAGGCGGTCAACTGTGAACAGGC 200 

GCCAGCGGCGGAGCCATGCAATGAGTGTCCAGCTTGCCTCGGCATTACGA 

ATGGAACGGTTCCCGATGTGCTGGAAATTGACGC.TGCTTCCAACAACCGC 300 

GTCGATGAAATTCGTGATATCCGTGAGAAGGTGAAATTTGCGCCAACGTC 

GGCCCGCTACAAAGTGTATATCATCGACGAGGTGCATATGCTGTCGATCG 4 00 

GTGCGTTTAACGCGCTGTTGAAAACGTTGGAGGAGCCGCCGAAACACGTC 

ATTTTCATTTTGGCCACGACCGAGCCGCACAAAATTCCGGCGACGATCAT 500 

TTCCCGCTGCCAACGGTTCGATTTTCGCCGCATCCCGCTTCAGGCGATCG ' ' 

TTTCACGGCTAAAGTACGTCGCAAGCGCCCAAGGTGTCGAGGCGTCAGAT 600 

GAGGCATTGTCCGCCATCGCCCGTGCTGCAGACGGGGGGATGCGCGATGC 

GCTCAGCTTGCTTGATCAAGCCATTTCGTTCAGCGACGGGAAACTTCGGC 7 00 

TCGACGACGTGCTGGCGATGACCGGGGCTGCATCATTTGCCGCCTTATCG 

AGCTTCATCGAAGCCATCCACCGCAAAGATACAGCGGCGGTTCTTCAGCA 8 00 

CTTGGAAACGATGATGGCGCAAGGGAAAGATCCGCATCGTTTGGTTGAAG 

ACTTGATTTTGTACTATCGCGATTTATTGCTGTACAAAACCGCTCCCTAT 900 

GTGGAGGGAGCGATTCAAATTGCTGTCGTTGACGAAGCGTTCACTTCACT 

GTCGGAAATGATTCCGGTTTCCAATTTATACGAGGCCATCGAGTTGCTGA 1000 

ACAAAAGCCAGCAAGAGATGAAGTGGACAAACCACCCGCGCCTTCTGTTG 

GAAGTGGCGCTTGTGAAACTTTGCCATCCATCAGCCGCCGCCCCGTCGCT 1100 

GTCGGCTTCCGAGTTGGAACCGTTGAT7y\AGCGGATTGAAACGCTGGAGG 

CGGAATTGCGGCGCCTGAAGGAACAACCGCCTGCCCCTCCGTCGACCGCC 1200 

GCGCCGGTGAAAAAACTGTCCAAACCGATGAAAACGGGGGGATATAAAGC 

CCCGGTTGGCCGCATTTACGAGCTGTTGAAACAGGCGACGCATGAAGATT 1300 

TAGCTTTGGTGAAAGGATGCTGGGCGGATGTGCTCGACACGTTGAAACGG 

CAGCATAAAGTGTCGCACGCTGCCTTGCTGCAAGAGAGCGAGCCGGTTGC 14 00 

AGCGAGCGCCTCAGCGTTTGTATTAAAATTCAAATACGAAATCCACTGCA 

AAATGGCGACCGATCCCACAAGTTCGGTCAAAGAAAACGTCGAAGCGATT 1500 

TTGTTTGAGCTGACAAACCGCCGCTTTGAAATGGTAGCCATTCCGGAGGG 

AGAATGGGGAAAAATAAGAGAAGAGTTCATCCGCAATAAGGACGCCATGG 1600 

TGGAAAAAAGCGAAGAAGATCCGTTAATCGCCGAAGCGAAGCGGCTGTTT 

G G C G AAG AG C T GAT C G AAAT T AAA G AA 167 7 
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VAYQALYRVFRPQRFADMVGQEHVTKTLQSALLQHKISHAYLFSGPRGTG 
KTSAAKIFAKAVNCEQAPAAEPCNECPACLGITNGTVPDVLEIDAASNNR 100 
VDEIRDIREKVKFAPTSARYKVYIIDEVHMLSIGAFNALLKTLEEPPKHV ' ■ 
I FILATTEPHKI PAT I I SRCQRFDFRRI PLQAI VSRLKYVASAQGVEASD 200 
EALSAIARAADGGMRDALSLLDQAISFSDGKLRLDDVLAMTGAASFAALS 
SFIEAIHRKDTAAVLQHLETMMAQGKDPHRLVEDLILYYRDLLLYKTAPY 300 
VEGAIQIAWDEAFTSLSEMIPVSNLYEAIELLNKSQQEMKWTNHPRLLL 
EVALVKLCHPSAAAPSLSASELEPLIKRIETLEAELRRLKEQPPAPPSTA 400 
APVKKLSKPMKTGGYKAPVGRIYELLKQATHEDLALVKGCWADVLDTLKR 
QHKVSHAALLQESEPVAASASAFVLKFKYEIHCKMATDPTSSVKENVEAI '500 
LFELTNRRFEMVAIPEGEWGKIREEFIRNKDAMVEKSEEDPLIAEAKRLF 
GEELIEIKE 559 
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ATGGTGACAAAAGAGCAAAAAGAGCGGTTTCTCATCCTGCTTGAGCAGCT 
GAAGATGACGTCGGACGAATGGATGCCGCATTTTCGTGAGGCAGCCATTC 100 
GCAAAGTCGTGATCGATAAAGAGGAGAAAAGCTGGCATTTTTATTTTCAG ■ 
TTCGACAACGTGCTGCCGGTTCATGTATACAAAACGTTTGCCGATCGGCT' - 200 
GCAGACGGCGTTCCGCCATATCGCCGCCGTCCGCCATACGATGGAGGTCG 
AAGCGCCGCGCGTAACTGAGGCGGATGTGCAGGCGTATTGGCCGCTTTGC 300 
CTTGCCGAGCTGCAAGAAGGCATGTCGCCGCTTGTCGATTGGCTCAGCCG 
GCAGACGCCTGAGCTGAAAGGAAACAAGCTGCTTGTCGTTGCCCGCCATG 400 
AAGCGGAAGCGCTGGCGATCAAACGGCGGTTCGCCAAAAAAATCGCTGAT 
GTGTACGCTTCGTTTGGGTTTCCCCCCCTTCAGCTTGACGTCAGCGTCGA 500 
GCCGTCCAAGCAAGAAATGGAACAGTTTTTGGCGCAAAAACAGCAAGAGG 
ACGAAGAGCGAGCGCTTGCTGTACTGACCGATTTAGCGAGGGAAGAAGAA 600 
AAGGCCGCGTCTGCGCCGCCGTCCGGTCCGCTTGTCATCGGCTATCCGAT 
CCGCGACGAGGAGCCGGTGCGGCGGCTTGAAACGATCGTCGAAGAAGAGC 700 
GGCGCGTCGTTGTGCAAGGCTATGTATTTGACGCCGAAGTGAGCGAATTA 
AAAAGCGGCCGCACGCTGTTGACCATGAAAATCACAGATTACACGAACTC 800 
GATTTTAGTCAAAATGTTCTCGCGCGACAAAGAGGACGCCGAGCTTATGA 
GCGGCGTCAAAAAAGGCATGTGGGTGAAAGTGCGCGGCAGCGTGCAAAAC 900 
GATACGTTCGTCCGTGATTTGGTCATCATCGCCAACGATTTGAACGAAAT 
CGCCGCAAACGAACGGCAAGATACGGCGCCGGAAGGGGAAAAGAGGGTCG 1000 
AGCTCCATTTGCATACCCCGATGAGCCAAATGGACGCGGTCACCTCGGTG 
ACAAAACTCATTGAGCAAGCGAAAAAATGGGGGCATCCGGCGATCGCCGT 1100 
CACCGACCATGCCGTTGTTCAGTCGTTTCCGGAGGCCTACAGCGCGGCGA 
AAAAACACGGCATGAAGGTCATTTACGGCCTTGAGGCGAACATCGTCGAC 1200, 
GATGGCGTGCCGATCGCCTACAATGAGACGCACCGCCGTCTTTCGGAGGA 
AACGTACGTCGTCTTTGACGTCGAGACGACGGGCCTGTCGGCTGTGTACA 1300 
ATACGATCATTGAGCTGGCGGCGGTGAAAGTGAAAGACGGCGAGATCATC 
GACCGATTCATGTCGTTTGCCAACCCTGGACATCCGTTGTCGGTGACAAC 1400 
GATGGAGCTGACTGGGATCACCGATGAGATGGTGAAAGACGCCCCGAAGC 
CGGACGAGGTGCTAGCCCGTTTTGTTGACTGGGCCGGCGATGCGACGCTT 1500 
GTTGCCCACAACGCCAGCTTTGACATCGGTTTTTTAAACGCGGGCCTCGC 
TCGCATGGGGCGCGGCAAAATCGCGAATCCAGTCATCGATACGCTCGAGC 1600 
TGGCCCGTTTTTTATACCCGGATTTGAAAAACCATCGGCTCAATACATTG 
TGCAAAAAATTTGACATTG7\ATTGACGCAGCATCACCGCGCCATCTACGA 1700 
CGCGGAGGCGACCGGGCATTTGCTTATGCGGCTGTTGAAGGAAGCGGAAG 
AGCGCGGCATACTGTTTCATGACGAATTAAACAGCCGCACGCACAGCGAA 1800 
GCGTCCTATCGGCTTGCGCGCCCGTTCCATGTGACGCTGTTGGCGCAAAA 
CGAGACTGGATTGAAAAATTTGTTCAAGCTTGTGTCATTGTCGCACATTC 1900 
AATATTTTCACCGTGTGCCGCGCATCCCGCGCTCCGTGCTCGTCAAGCAC 
CGCGACGGCCTGCTTGTCGGCTCGGGCTGCGACAAAGGAGAGCTGTTTGA 2 000 
CAACTTGATCCAAAAGGCGCCGGAAGAAGTCGAAGACATCGCCCGTTTTT 
ACGATTTTCTTGAAGTGCATCCGCCGGACGTGTACAAGCCGCTCATCGAG 2100 
ATGGATTATGTGAAAGACGAAGAGATGATCAAAAACATCATCCGCAGCAT 
CGTCGCCCTTGGTGAGAAGCTTGACATCCCGGTTGTCGCCACTGGCAACG 2200 
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TCGCAAGGCGGGGCGAATCCGCTCAACCGCCATGAACTGCCGGATGTATA .2300 
TTTCCGTACGACGAATGAAATGCTTGACTGCTTCTCGTTTTTAGGGCCGG 
AAAAAGCGAAGGAAATCGTCGTTGACAACACGCAAAAAATCGCTTCGTTA *24 00' ■ 
ATCGGCGATGTCAAGCCGATCAAAGATGAGCTGTATACGCCGCGCATTGA 
AGGGGCGGACGAGGAAATCAGGGAAATGAGCTACCGGCGGGCGAAGGAAA 2 500 
TTTACGGCGACCCGTTGCCGAAACTTGTTGAAGAGCGGCTTGAGAAGGAG 
CTAAAAAGCATCATCGGCCATGGCTTTGCCGTCATTTATTTGATCTCGCA 2 600 
CAAGCTTGTGAAAAAATCGCTCGATGACGGCTACCTTGTCGGGTCGCGCG 
GATCGGTCGGCTCGTCGTTTGTCGCGACGATGACGGAAATCACCGAGGTC 2'70 0 
AATCCGCTGCCGCCGCATTACGTTTGCCCGAACTGCAAGCATTCGGAGTT 
CTTTAACGACGGTTCAGTCGGCTCAGGGTTTGATTTGCCGGATAAAAACT 28 00 
GCCCGCGATGTGGGACGAAATACAAGAAAGACGGGCACGACATCCCGTTT 
GAGACGTTTCTCGGCTTTAAAGGCGACAAAGTGCCGGATATCGACTTGAA 2900 
CTTTTCCGGCGAATACCAGCCGCGCGCCCACAACTATACGAAAGTGCTGT 
TTGGCGAAGACAACGTCTACCGCGCCGGGACGATTGGCACGGTCGCTGAC 3000 
AAAACGGCGTACGGATTTGTCAAAGCGTATGCGAGCGACCATAACTTAGA 
GCTGCGCGGCGCGGAAATCGACGGCTCGCGGCTGGCTGCACCGGGGTGAA 3100 
GCGGACGACCGGGCAGCATCCGGGCGGCATCATCGTCGTCCCGGATTATA 
TGGAAATTTACGATTTTACGCCGATTCAATATCCGGCCGATGACACGTCC 3200 
TCTGAATGGCGGACGACCCATTTCGACTTCCATTCGATCCACGACAATTT 
GTTGAAGCTCGATATTCTCGGGCACGACGATCCGACGGTCATTCGCATGC 3300 
TGCAAGATTTAAGCGGCATCGATCCGAAAACGATCCCGACCGACGACCCG 
GATGTGATGGGCATTTTCAGCAGCACCGAGCCGCTTGGCGTTACGCCGGA 34 00 
GCAAATCATGTGCAATGTCGGCACGATCGGCATTCCGGAGTTTGGCACGC 
GCTTCGTTCGGCAAATGTTGGAAGAGACAAGGCCAAAAACGTTTTCCGAA 3500 
CTCGTGCAAATTTCCGGCTTGTCGCACGGCACCGATGTGTGGCTCGGCAA 
CGCGCAAGAGCTCATTCAAAACGGCACGTGTACGTTATCGGAAGTCATCG 3600 
GCTGCCGCGACGACATTATGGTCTATTTGATTTACCGCGGGCTCGAGCCG 
TCGCTCGCTTTTAAAATCATGGAATCCGTGCGCAAAGGAAAAGGCTTAAC 37 00 
GCCGGAGTTTGAAGCAGAAATGCGCAAACATGACGTGCCGGAGTGGTACA 
TCGATTCATGCAAAAAAATCAAGTACATGTTCCCGAAAGCGCACGCCGCC 38 00 
GCCTACGTGTTAATGGCGGTGCGCATCGCCTACTTTAAGGTGCACCATCC 
GCTTTTGTATTACGCGTCGTACTTTACGGTGCGGGCGGAGGACTTTGACC 3900 
TTGACGCCATGATCAAAGGATCACCCGCCATTCGCAAGCGGATTGAGGAA 
ATCAACGCCAAAGGCATTCAGGCGACGGCGAAAGAAAAAAGCTTGCTCAC 4 000 
GGTTCTTGAGGTGGCCTTAGAGATGTGCGAGCGCGGCTTTTCCTTTAAAA 
ATATCGATTTGTACCGCTCGCAGGCGACGGAATTCGTCATTGACGGCAAT 4100 
TCTCTCATTCCGCCGTTCAACGCCATTCCGGGGCTTGGGACGAACGTGGC 
GCAGGCGATCGTGCGCGCCCGCGAGGAAGGCGAGTTTTTGTCGAAGGAGG 4200 
ATTTGCAACAGCGCGGCAAATTGTCGAAAACGCTGCTCGAGTATCTAGAA 
AGCCGCGGCTGCCTTGACTCGCTTCCAGACCATAACCAGCTGTCGCTGTT 4300 
T 
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MVTKEQKERFLILLEQLKMTSDEWMPHFREAAIRKWIDKEEKSWHFYFQ 
FDNVLPVHVYKTFADRLQTAFRHIAAVRHTMEVEAPRVTEADVQAYWPLC 100 
LAELQEGMSPLVDWLSRQTPELKGNKLLVVARHEAEALAIKRRFAKKIAD " 
VYAS FG F P PLQL DVS VE P S KQEMEQ FLAQKQQE DEERALAVLT DLAREEE ' 200 
KAASAPPSGPLVIGYPIRDEEPVRRLETIVEEERRWVQGYVFDAEVSEL' 
KSGRTLLTMKI T D YTN S I LVKMFS RDKE DAELMSGVKKGMWVKVRGS VQN 300 
DTFVRDLVIIANDLNEIAANERQDTAPEGEKRVELHLHTPMSQMDAVTSV 
TKLIEQAKKWGHPAIAVTDHAVVQSFPEAYSAAKKHGMKVIYGLEANIVD 400 
DGVPIAYNETHRRLSEETYVVFDVETTGLSAVYNTIIELAAVKVKDGEII 
DRFMSFANPGHPLSVTTMELTGITDEMVKDAPKPDEVLARFVDWAGDATL 500 
VAHNASFDIGFLNAGLARMGRGKIANPVIDTLELARFLYPDLKNHRLNTL 
CKKFDIELTQHHRAIYDAEATGHLLMRLLKEAEERGILFHDELNSRTHSE 600 
ASYRLARPFHVTLLAQNETGLKNLFKLVSLSHIQYFHRVPRIPRSVLVKH 
RDGLLVGSGCDKGELFDNLIQKAPEEVEDIARFYDFLEVHPPDVYKPLIE 700 
MDYVKDEEMIKNI IRSIVALGEKLDIPWATGNVHYLNPEDKIYRKILIH 
SQGGANPLNRHELPDVYFRTTNEMLDCFSFLGPEKAKEIWDNTQKIASL 800 
IGDVKPIKDELYTPRIEGADEEIREMSYRRAKEIYGDPLPKLVEERLEKE 
LKSIIGHGFAVIYLISHKLVKKSLDDGYLVGSRGSVGSSFVATMTEITEV 900 
NPLPPHYVCPNCKHSEFFNDGSVGSGFDLPDKNCPRCGTKYKKDGHDIPF 
ETFLGFKGDKVPDIDLNFSGEYQPRAHNYTKVLFGEDNVYRAGTIGTVAD 1000 
KTAYGFVKAYASDHNLELRGAEIDLAAGCTGVKRTTGQHPGGIIWPDYM 
EIYDFTPIQYPADDTSSEWRTTHFDFHSIHDNLLKLDILGHDDPTVIRML 1100 
QDLSGIDPKTIPTDDPDVMGIFSSTEPLGVTPEQIMCNVGTIGIPEFGTR 
FVRQMLEETRPKTFSELVQISGLSHGTDVWLGNAQELIQNGTCTLSEVIG 1200 
CRDDIMVYLIYRGLEPSLAFKIMESVRKGKGLTPEFEAEMRKHDVPEWYI 
DSCKKIKYMFPKAHTU^YVLMAVRI AYFKVHHPLLYYAS YFTVRAEDFDL 1300 
DAMIKGS PAI RKRI EE INAKGIQATAKEKSLLTVLEVALEMCERGFS FKN 
IDLYRSQATEFVIDGNSLIPPFNAIPGLGTNVAQAIVRAREEGEFLSKED 14 00 
LQQRGKLSKTLLEYLESRGCLDSLPDHNQLSLF 
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